Google’s new AI agent will do your web browsing—Is this the end of clicks?
Technology#HRTech#HRCommunity#Artificial Intelligence
In a groundbreaking move, Google has unveiled its first-ever AI agent, Project Mariner, designed to take actions on the web without user input. Powered by Gemini, Google’s advanced AI system, Project Mariner is a research prototype developed by the company’s DeepMind division. The new AI agent can perform tasks that typically require human interaction with websites, such as filling out forms, clicking buttons, and navigating through web pages, all while controlling a Chrome browser. This innovation marks a significant shift in how users may interact with the web in the future.
Google began rolling out the new AI agent to a small group of preselected testers on Wednesday, offering an exclusive first look at its capabilities. This experiment forms part of Google’s ongoing efforts to explore how its Gemini AI system can enhance user experiences by reading, summarizing, and now using websites autonomously. According to a Google executive, this marks the beginning of a "fundamentally new UX paradigm shift," which shifts the user’s role from directly interacting with websites to relying on an AI agent that takes over these tasks.
In a demonstration with TechCrunch, Jaclyn Konzelmann, Director of Google Labs, showcased how Project Mariner operates. After installing an extension in Chrome, users can activate a chat window beside their browser, instructing the AI to perform various tasks. For instance, users can ask the agent to "create a shopping cart from a grocery store based on this list." In the demo, the AI agent navigated to a grocery store’s website, Safeway, and added items to a shopping cart based on the provided list. While impressive, the agent's actions were not without limitations—there was a noticeable delay of about five seconds between each cursor movement, and the agent occasionally stopped to request clarifications about specific items.
Project Mariner, however, is not yet capable of completing transactions. It cannot check out items, process payments, or fill in credit card and billing information. Google has intentionally restricted the agent from performing such tasks to provide users with greater control over their personal information. Additionally, the agent avoids accepting cookies or signing terms of service agreements, ensuring privacy and security for users. The agent takes screenshots of the user’s browser window for processing, which are sent to the Gemini cloud for analysis before executing the required actions.
The AI agent's potential to assist in everyday tasks like shopping for groceries, booking flights, or finding recipes is clear, but it comes with some caveats. Project Mariner currently only works within the active tab of a Chrome browser, meaning users must watch the agent perform tasks in real time and cannot use their computer for other activities while the agent is at work. This decision, according to Google DeepMind’s Chief Technology Officer Koray Kavukcuoglu, is to ensure that users are aware of exactly what the AI is doing on their behalf. "It’s complementary. You, as an individual, can use websites, and now your agent can do everything that you do on a website as well," Kavukcuoglu explained.
For website owners, there’s a silver lining: since Project Mariner operates within the user’s browser, businesses and publishers still have their pages viewed by users, maintaining traffic and engagement. However, the AI agent could reduce the amount of time users spend interacting with the site itself, potentially altering the web's dynamics. Google acknowledges that this shift could have far-reaching effects on how businesses approach user experience and how they design web interactions in the future.
Beyond Project Mariner, Google also introduced other AI agents tailored to specific tasks. The Deep Research agent helps users explore complex topics by generating multistep research plans and generating detailed reports on the findings. Google also launched Jules, an AI agent designed to assist developers by integrating directly into GitHub workflows. Additionally, Google DeepMind is working on an AI agent to assist with navigating video games, building on the company's extensive history in game-playing AI.
While it remains unclear when Project Mariner will be made available to Google’s broader user base, its release will likely have a profound impact on the way people engage with the web. By automating tasks traditionally carried out by users, Google’s new AI agent could change the standard for web interaction, potentially rendering human engagement with websites a thing of the past. As Google continues to refine Project Mariner and its other AI offerings, the broader implications for business, privacy, and user autonomy will become clearer in the coming months.