Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Android 17 brings seamless app handoff across devices and web

Android 17 brings seamless app handoff across devices and web

15 February 2026
Your Pixel is getting Android 17 again

Your Pixel is getting Android 17 again

14 February 2026
Google boosts Gemini 3 Deep Think AI and it’s a huge milestone for 3D printing

Google boosts Gemini 3 Deep Think AI and it’s a huge milestone for 3D printing

14 February 2026
Facebook X (Twitter) Instagram
Just In
  • Android 17 brings seamless app handoff across devices and web
  • Your Pixel is getting Android 17 again
  • Google boosts Gemini 3 Deep Think AI and it’s a huge milestone for 3D printing
  • Robot Dogs Are on Going on Patrol at the 2026 World Cup in Mexico
  • Gear News of the Week: Samsung Sets a Date for Galaxy Unpacked, and Fitbit’s AI Coach Comes to iOS
  • Here’s What It’s Like to Use H&R Block’s DIY Tax Service (2026)
  • Ring Kills Flock Safety Deal After Super Bowl Ad Uproar
  • Best Alternatives to Google’s Android Operating System (2026), Tested and Reviewed
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » Anthropic Wants Its AI Agent to Control Your Computer
News

Anthropic Wants Its AI Agent to Control Your Computer

News RoomBy News Room22 October 20243 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Anthropic Wants Its AI Agent to Control Your Computer
Share
Facebook Twitter LinkedIn Pinterest Email

Demos of AI agents can seem stunning, but getting the technology to perform reliably and without annoying (or costly) errors in real life can be a challenge. Current models can answer questions and converse with almost humanlike skill, and are the backbone of chatbots such as OpenAI’s ChatGPT and Google’s Gemini. They can also perform tasks on computers when given a simple command by accessing the computer screen as well as input devices like a keyboard and trackpad, or through low-level software interfaces.

Anthropic says that Claude outperforms other AI agents on several key benchmarks including SWE-bench, which measures an agent’s software development skills, and OSWorld, which gauges an agent’s capacity to use a computer operating system. The claims have yet to be independently verified. Anthropic says Claude performs tasks in OSWorld correctly 14.9 percent of the time. This is well below humans, who generally score around 75 percent, but considerably higher than the current best agents—including OpenAI’s GPT-4—which succeed roughly 7.7 percent of the time.

Anthropic claims that several companies are already testing the agentic version of Claude. This includes Canva, which is using it to automate design and editing tasks, and Replit, which uses the model for coding chores. Other early users include The Browser Company, Asana, and Notion.

Ofir Press, a postdoctoral researcher at Princeton University who helped develop SWE-bench, says that agentic AI tends to lack the ability to plan far ahead and often struggles to recover from errors. “In order to show them to be useful we must obtain strong performance on tough and realistic benchmarks,” he says, such as reliably planning a wide range of trips for a user and booking all the necessary tickets.

Kaplan notes that Claude can already troubleshoot some errors surprisingly well. When faced with a terminal error when trying to start a web server, for instance, the model knew how to revise its command to fix it. It also worked out that it had to enable popups when it ran into a dead end browsing the web.

Many tech companies are now racing to develop AI agents as they chase market share and prominence. In fact, it might not be long before many users have agents at their fingertips. Microsoft, which has poured upwards of $13 billion into OpenAI, says it is testing agents that can use Windows computers. Amazon, which has invested heavily in Anthropic, is exploring how agents could recommend and eventually buy goods for its customers.

Sonya Huang, a partner at the venture firm Sequoia who focuses on AI companies, says for all the excitement around AI agents, most companies are really just rebranding AI-powered tools. Speaking to WIRED ahead of the Anthropic news, she says that the technology works best currently when applied in narrow domains such as coding-related work. “You need to choose problem spaces where if the model fails, that’s okay,” she says. “Those are the problem spaces where truly agent native companies will arise.”

A key challenge with agentic AI is that errors can be far more problematic than a garble chatbot reply. Anthropic has imposed certain constraints on what Claude can do—for example, limiting its ability to use a person’s credit card to buy stuff.

If errors can be avoided well enough, says Press of Princeton University, users might learn to see AI—and computers—in a completely new way. “I’m super excited about this new era,” he says.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleHonor Magic 7 Series Confirmed to Feature Snapdragon 8 Elite Chip; Autopilot AI Teased at Snapdragon Summit
Next Article Get early holiday savings on this JBL soundbar at Best Buy!

Related Articles

Android 17 brings seamless app handoff across devices and web
News

Android 17 brings seamless app handoff across devices and web

15 February 2026
Your Pixel is getting Android 17 again
News

Your Pixel is getting Android 17 again

14 February 2026
Google boosts Gemini 3 Deep Think AI and it’s a huge milestone for 3D printing
News

Google boosts Gemini 3 Deep Think AI and it’s a huge milestone for 3D printing

14 February 2026
Robot Dogs Are on Going on Patrol at the 2026 World Cup in Mexico
News

Robot Dogs Are on Going on Patrol at the 2026 World Cup in Mexico

14 February 2026
Gear News of the Week: Samsung Sets a Date for Galaxy Unpacked, and Fitbit’s AI Coach Comes to iOS
News

Gear News of the Week: Samsung Sets a Date for Galaxy Unpacked, and Fitbit’s AI Coach Comes to iOS

14 February 2026
Here’s What It’s Like to Use H&R Block’s DIY Tax Service (2026)
News

Here’s What It’s Like to Use H&R Block’s DIY Tax Service (2026)

14 February 2026
Demo
Top Articles
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024126 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024109 Views
Costco partners with Electric Era to bring back EV charging in the U.S.

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202498 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
Here’s What It’s Like to Use H&R Block’s DIY Tax Service (2026) News

Here’s What It’s Like to Use H&R Block’s DIY Tax Service (2026)

News Room14 February 2026
Ring Kills Flock Safety Deal After Super Bowl Ad Uproar News

Ring Kills Flock Safety Deal After Super Bowl Ad Uproar

News Room14 February 2026
Best Alternatives to Google’s Android Operating System (2026), Tested and Reviewed News

Best Alternatives to Google’s Android Operating System (2026), Tested and Reviewed

News Room14 February 2026
Most Popular
The Spectacular Burnout of a Solar Panel Salesman

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025137 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024126 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024109 Views
Our Picks
Robot Dogs Are on Going on Patrol at the 2026 World Cup in Mexico

Robot Dogs Are on Going on Patrol at the 2026 World Cup in Mexico

14 February 2026
Gear News of the Week: Samsung Sets a Date for Galaxy Unpacked, and Fitbit’s AI Coach Comes to iOS

Gear News of the Week: Samsung Sets a Date for Galaxy Unpacked, and Fitbit’s AI Coach Comes to iOS

14 February 2026
Here’s What It’s Like to Use H&R Block’s DIY Tax Service (2026)

Here’s What It’s Like to Use H&R Block’s DIY Tax Service (2026)

14 February 2026

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2026 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.