The Browser Becomes the Agent: Why Search Starts to Act The Browser Becomes the Agent: Why Search Starts to Act

For years, search engines were built to find information and send you to the desired page. Now they are learning to do some of the work too. In 2025 and 2026, Google, OpenAI, Perplexity and Microsoft all pushed products that can compare options, fill forms, book tables, search across many sites, and keep a task moving inside the browser. AI search is becoming more active, more task-driven, and much closer to a real browser agent.

The browser is where digital life already happens. We shop there, plan trips there, read reviews there, log in there, and often end up drowning in tabs there. An AI model that can see a page, understand your goal, and click through steps has a clear advantage over a chatbot that only writes a nice summary.

AI Search Is Moving Into Action

Google’s own numbers show how fast the answer layer is growing. AI Overviews reached more than 1 billion monthly users in October 2024, passed 1.5 billion per month by spring 2025, and went beyond 2 billion monthly users by July 2025. Google also said these AI results drove more than 10% growth in the kinds of queries where they appear, and AI Mode had more than 100 million monthly active users in the U.S. and India by mid-2025.

User behavior is changing too. In April 2025, Google said AI Mode queries were about twice as long as classic searches. By early 2026, it said they had become three times as long, with more follow-up questions and more voice or image input. People are writing search prompts more like requests to an assistant: full context, clear goals, and a hope that the machine will handle the boring part.

Why the Browser Agent Makes Sense

The browser is the best place for this next step because the web is already built as a giant place for actions. Websites have buttons, text fields, calendars, filters, carts, maps and checkout flows. OpenAI’s Computer-Using Agent was trained to work with those visible elements, like a human using a mouse and keyboard.

Google’s Project Mariner works in a similar way: it observes what is on the page, makes a plan, and acts while keeping the user informed.

Google Is Pushing Search Closer to a Browser Agent

Google has been direct about this change. AI Mode uses a “query fan-out” method that breaks a question into many sub-queries and runs them in parallel. Google says Deep Search can push that further by issuing hundreds of searches, combining different sources, and building a cited report in minutes.

Then come the action features. Google said AI Mode can use Project Mariner’s browsing abilities to find restaurant reservations, local appointments, event tickets and shopping options. In its examples, the system checks multiple sites, analyzes live availability, fills in tedious form steps and hands the user a near-finished result.

Google also says AI Mode can browse a Shopping Graph of more than 50 billion product listings, show virtual try-on results and handle AI-assisted checkout with Google Pay when a target price is met, with user oversight.

The browser itself is now becoming part of that shift. In September 2025, Google started bringing AI Mode into Chrome’s address bar and side panel, so people could ask questions about the page they were already viewing without leaving it. Search is no longer waiting for neatly typed keywords. It is becoming visual, conversational and more willing to stay with the user through a task.

OpenAI, Perplexity, and Microsoft Want the Same Future

Google is not alone. OpenAI launched Operator in early 2025 as a browser-using agent, then expanded the idea with ChatGPT agent and Atlas. The ChatGPT agent combines research tools with a visual browser, a text browser, a terminal and connectors to apps. Atlas turns the browser itself into the product: ChatGPT sits in every tab, can use browsing context and can trigger agent mode while you work.

Perplexity is betting on the same idea from a different angle. Its Comet browser describes itself as a personal AI assistant that can handle research, shopping, finance, travel planning and inbox work. Comet is now available on Mac, Windows, iOS and Android. Microsoft has also moved from answers toward tasks. Copilot Actions can help users book hotels, make dinner reservations, order flowers, find flights and schedule tours from the browser.

Across the industry, the same idea keeps resurfacing: the best way to win the future of search may be to own the moment after the answer, when the user wants something done.

How AI Search Changes SEO

Traditional SEO aimed to win the click from a ranked results page. AI search adds a second contest: being selected, cited or used inside the answer itself. That is why marketers now talk about GEO, or generative engine optimization, and AEO, or answer engine optimization. The labels may change, but the pressure is the same. Content now has to rank, be understood and be reusable by AI systems.

The traffic picture is mixed, and that matters for publishers. Google says AI search still sends billions of clicks to the web every day, and that clicks from AI Overviews tend to be higher quality, with users spending more time on the site. Independent research is less cheerful. Ahrefs said that, in its 2026 update, the presence of an AI Overview was linked to a 58% lower click-through rate for the top-ranking page. For publishers and marketers, this changes how content is found, cited, and clicked.

At the same time, AI can create new discovery paths. Similarweb said AI platforms generated more than 1.1 billion referral visits in June 2025, up 357% year over year, and that referrals to shopping and other transactional sites converted at about 7%. It also found that roughly 95% of ChatGPT users still rely on Google. So the smarter read is not “Google is over.” The real story is a wider discovery market where old search, AI answers and browser agents overlap.

Privacy, Mistakes, and a Web That Pushes Back

There is a price for convenience, and the clearest risk is loss of control. Browser agents often need screenshots, browsing history, saved sessions and page context to work well. The ChatGPT agent uses screenshots of its virtual browser window to see pages, and chats, browsing history, and screenshots stay in the conversation history until the user deletes them. It also says take-over mode hides sensitive inputs while the user types.

Vendors know trust is the weak point, so they are adding guardrails. OpenAI says Atlas cannot install extensions, cannot access other apps or the local file system, and pauses on some sensitive financial actions. The operator requires human confirmation for purchases, emails and deleted calendar events. Google says AI Mode personalization is optional and under user control. All of that is sensible. It is also a quiet admission that browser agents can still make mistakes or get tricked by malicious instructions hidden on webpages.

The open web may also resist. In March 2026, Amazon won a court order blocking access for Perplexity’s shopping agent, arguing that the system allegedly entered customer accounts in a hidden way and created security risks. We looked at that clash in detail in our earlier piece on Amazon vs. Perplexity, because the case goes beyond one lawsuit: it shows how AI shopping agents could run into platform rules, privacy concerns, and a growing fight over who controls online buying.

What the Future of AI Search Means for You

Search is moving from a tool that finds pages to a tool that helps finish tasks. That will likely make everyday research faster, reduce tab overload, and push more buying, booking and planning into AI-driven flows. It will also shift power toward the products that own user context: the browser, the assistant, the account, the payment layer, and the memory of what you were trying to do five minutes ago.

For readers, a simple rule works well:

  • Use agentic search for repetitive tasks with clear rules, like product comparison, trip planning or source gathering.
  • Slow down when money, private data, health, or legal choices are involved.
  • Keep an eye on settings for browser memory, screenshots, and account connections.

The browser may be becoming the agent, but it still needs supervision. Think of it as a very fast intern with screen access. Useful? Very. Ready to run the company alone? Not today.

Author's other posts

Nebius Plans $10B AI Data Center in Finland Amid Europe AI Race
Article
Nebius Plans $10B AI Data Center in Finland Amid Europe AI Race
Nebius plans a 310 MW AI data center in Finland. Here is why the Lappeenranta project matters for Europe’s AI race, infrastructure, and sovereignty.
Apple Maps Ads Could Reshape Local Search
Article
Apple Maps Ads Could Reshape Local Search
Apple Maps ads could reshape local search for brands and small businesses, changing how users discover places through Maps, Siri, and Apple Business.
Reliable AI Knows When to Say: “This Makes No Sense”
Article
Reliable AI Knows When to Say: “This Makes No Sense”
BullshitBench shows why reliable AI must detect broken premises, not just generate fluent answers. A look at AI reliability, pushback, and false premise detection.
Anthropic Wins First Round in Pentagon AI Clash
Article
Anthropic Wins First Round in Pentagon AI Clash
Anthropic wins an early court battle in its Pentagon AI dispute, raising bigger questions about military AI guardrails, contracts, and control.