News Report Technology
April 01, 2025

Amazon AGI Labs Unveils Nova Act AI Agent System That Can Control Browsers To Perform Tasks

In Brief

Amazon AGI Labs has unveiled the Nova Act AI model designed to perform tasks within a web browser, and has released a research preview of its SDK, allowing developers to experiment with the model‘s early version.

Amazon AGI Labs Unveils Nova Act AI Agent System That Can Control Browsers To Perform Tasks

Amazon AGI Labs, the company’s dedicated division focused on advancing Artificial General Intelligence (AGI), has unveiled the Amazon Nova Act, a new AI model designed to perform tasks within a web browser.

In conjunction with this, Amazon AGI Labs has released a research preview of the Amazon Nova Act software development kit (SDK), which will allow developers to experiment with an early version of the model. Through this SDK, developers can create agents capable of completing a variety of tasks in a web browser, such as submitting an out-of-office request in an internal system, setting calendar holds, or sending “away from office” email notifications.

The Nova Act SDK provides developers with the ability to break down complex workflows into smaller, manageable commands, such as searching, checking out, or answering questions based on what appears on the screen. Additionally, developers can include detailed instructions within these commands (e.g., “do not accept the insurance upsell”), call APIs, and even use Playwright to manipulate the browser directly, enhancing reliability in tasks like entering passwords. The SDK also allows for integration of Python code, enabling testing, breakpoints, assertions, or parallelized thread pools, addressing the inherent limitations of web page load times, even for the fastest agents.

Nova Act: A Reliable AI Model Aimed At Over 90% Accuracy For Complex Web Interactions

Nova Act is designed to provide reliable building blocks that can be combined into more complex workflows. While many agent benchmarks focus on high-level tasks, where state-of-the-art models typically achieve only 30% to 60% accuracy in completing tasks in web browsers, Nova Act is focused on ensuring reliability. Amazon AGI Labs aims for over 90% accuracy in internal evaluations, addressing challenges that often trip up other models, such as date picking, dropdown menus, and popups. The model is engineered to excel on benchmarks like ScreenSpot and GroundUI Web, which assess an AI’s ability to interact with the web. For example, the model scores 0.939 in interacting with textual elements on screenshots, 0.879 for interacting with visual elements, and 0.805 for understanding and engaging with various UI elements on web pages.

Amazon AGI Labs Unveils Nova Act AI Agent System That Can Control Browsers To Perform Tasks

In addition to performance, Nova Act emphasizes reliability. Once a user has configured the model, there is no need for constant oversight. Users can enable headless mode, turning the agent into an API that integrates seamlessly with other systems, or even set it to run asynchronously on a specified schedule.

Furthermore, though still in its early stages, Amazon AGI Labs is optimistic about Nova Act’s ability to adapt its user interface understanding across different environments. Notably, early checkpoints suggest that Nova Act performs well in novel settings, such as web games, even without prior experience in video games.

Additionally, with its combination of reliable building blocks and flexibility, Nova Act is already being integrated into Alexa+ to autonomously navigate the web and complete tasks when integrated services lack the necessary APIs.

Nova Act represents the first step in Amazon AGI Labs’ vision to develop the key capabilities needed for scalable, effective agents. This initial checkpoint is part of a larger training curriculum that aims to improve the model. To make agents truly intelligent and reliable for complex, multi-step tasks, Amazon AGI Labs believes that agents must be trained using reinforcement learning in a diverse set of real-world environments, rather than relying solely on supervised fine-tuning with simple demonstrations. The team is eager to share further research and progress as the model evolves.

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Alisa, a dedicated journalist at the MPost, specializes in cryptocurrency, zero-knowledge proofs, investments, and the expansive realm of Web3. With a keen eye for emerging trends and technologies, she delivers comprehensive coverage to inform and engage readers in the ever-evolving landscape of digital finance.

More articles
Alisa Davidson
Alisa Davidson

Alisa, a dedicated journalist at the MPost, specializes in cryptocurrency, zero-knowledge proofs, investments, and the expansive realm of Web3. With a keen eye for emerging trends and technologies, she delivers comprehensive coverage to inform and engage readers in the ever-evolving landscape of digital finance.

Hot Stories
Join Our Newsletter.
Latest News

From Ripple to The Big Green DAO: How Cryptocurrency Projects Contribute to Charity

Let's explore initiatives harnessing the potential of digital currencies for charitable causes.

Know More

AlphaFold 3, Med-Gemini, and others: The Way AI Transforms Healthcare in 2024

AI manifests in various ways in healthcare, from uncovering new genetic correlations to empowering robotic surgical systems ...

Know More
Read More
Read more
Sentient Launches Open-Source AI Search Framework, Capable To Outperform Perplexity
News Report Technology
Sentient Launches Open-Source AI Search Framework, Capable To Outperform Perplexity
April 2, 2025
Gitcoin Kicks Off ‘Grants Round 23’ To Support Early-Stage And Mature Projects, Donations Open Until April 16
News Report Technology
Gitcoin Kicks Off ‘Grants Round 23’ To Support Early-Stage And Mature Projects, Donations Open Until April 16
April 2, 2025
Gate.io: Setting New Standards In Spot Trading, Speed, And Security
News Report Technology
Gate.io: Setting New Standards In Spot Trading, Speed, And Security
April 2, 2025
Binance Introduces ‘Binance Seeds’ Initiative To Cultivate Next Generation Of Blockchain Talent
Education News Report Technology
Binance Introduces ‘Binance Seeds’ Initiative To Cultivate Next Generation Of Blockchain Talent
April 2, 2025