Google’s latest AI model uses a web browser like you do

The new Gemini 2.5 Computer Use model can click, scroll, and type in a browser window to access data that’s not available via an API.

Google is previewing a new Gemini AI model designed to navigate and interact with the web via a browser, letting AI agents do things inside interfaces designed for use by people and not robots. The model, called Gemini 2.5 Computer Use, uses “visual understanding and reasoning capabilities” to analyze a user’s request and carry out a task, such as filling out and submitting a form.

It can be used for UI testing or navigating interfaces made for people who don’t have an API or other direct connection available. Other versions of this model have been used for agentic features in AI Mode and Project Mariner, a research prototype that uses AI agents to carry out tasks on its own in a browser, like adding items to your cart based on a list of ingredients.

Google’s announcement comes just one day after OpenAI revealed new apps for ChatGPT as part of its annual Dev Day, and continues to focus its attention on its ChatGPT Agent feature that can complete complex tasks on your behalf. Meanwhile, Anthropic had already released a version of its Claude AI model with “computer use” last year.

Google posted some demo videos showing its computer use tool in action, and notes that they are sped up 3x.

Google says its computer use model “outperforms leading alternatives on multiple web and mobile benchmarks.” Unlike ChatGPT Agent and Anthropic’s computer use tool, Google’s new AI model only has access to a browser — not an entire computer environment. Google notes that it shows “it is not yet optimized for desktop OS-level control” and currently supports 13 actions, including opening a web browser, typing text, as well as dragging and dropping elements.

Gemini 2.5 Computer Use is available to developers through Google AI Studio and Vertex AI, but there’s also a demo on Browserbase, where you watch as it completes tasks, like “Play a game of 2048” or “Browse Hacker News for trending debates.

Source: www.theverge.com

Latest

Saudi Exchange shows resilience on Aramco shares, defying regional trend

RIYADH: Saudi Arabia’s Tadawul All Share Index showed signs...

Info Edge commits Rs 250 crore to new B8 Fund I to back growth-stage tech startups in India

Info Edge has approved a commitment of up to...

Scoop confirmed: AI platform MeltPlan raises $10 million to make construction boring

MeltPlan, a pre-construction AI platform, today said it has...

Indian agentic AI startup Gushwork raises $9 million to expand engineering teams

Gushwork, an agentic AI startup raised a $9 million...
the financial
the financial
Top platform for impactful conferences, news, and networking opportunities. Stay Connected. Stay Informed. Stay Ahead with The Financial

Saudi Exchange shows resilience on Aramco shares, defying regional trend

RIYADH: Saudi Arabia’s Tadawul All Share Index showed signs of resilience on March 2, buoyed by gains in Saudi Aramco, even as the Middle...

Info Edge commits Rs 250 crore to new B8 Fund I to back growth-stage tech startups in India

Info Edge has approved a commitment of up to Rs 250 crore to B8 Fund I, a newly launched scheme under B8 Trust, marking...

Scoop confirmed: AI platform MeltPlan raises $10 million to make construction boring

MeltPlan, a pre-construction AI platform, today said it has raised $10 million in a Seed funding round led by Bessemer Venture Partners, with participation from noa. The...