Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Vaping Is ‘Everywhere’ in Schools—Sparking a Bathroom Surveillance Boom

Vaping Is ‘Everywhere’ in Schools—Sparking a Bathroom Surveillance Boom

19 November 2025
‘Odd Lots’ Cohost Joe Weisenthal Has Predictions About How the AI Bubble Will Burst

‘Odd Lots’ Cohost Joe Weisenthal Has Predictions About How the AI Bubble Will Burst

18 November 2025
Yuji Horii On Making Dragon Quest Games: ‘I Do Think I Will Work On It Until I Die’

Yuji Horii On Making Dragon Quest Games: ‘I Do Think I Will Work On It Until I Die’

18 November 2025
Facebook X (Twitter) Instagram
Just In
  • Vaping Is ‘Everywhere’ in Schools—Sparking a Bathroom Surveillance Boom
  • ‘Odd Lots’ Cohost Joe Weisenthal Has Predictions About How the AI Bubble Will Burst
  • Yuji Horii On Making Dragon Quest Games: ‘I Do Think I Will Work On It Until I Die’
  • This Quest 3S Bundle Is $50 Off and Includes a Game and Gift Card
  • 4 Clever Tricks That Make It Worth Switching to Proton Mail
  • Microsoft’s Agent 365 Tries to Be the AI Bot Boss
  • Analogue 3D, Which Can Play Any N64 Cartridge, Continues The Company’s Stellar Reputation
  • Gemini 3 Is Here—and Google Says It Will Make Search Smarter
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » AI Agents Are Terrible Freelance Workers
News

AI Agents Are Terrible Freelance Workers

News RoomBy News Room29 October 20253 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
AI Agents Are Terrible Freelance Workers
Share
Facebook Twitter LinkedIn Pinterest Email

Even the best artificial intelligence agents are fairly hopeless at online freelance work, according to an experiment that challenges the idea of AI replacing office workers en masse.

The Remote Labor Index, a new benchmark developed by researchers at data annotation company Scale AI and the Center for AI Safety (CAIS), a nonprofit, measures the ability of frontier AI models to automate economically valuable work.

The researchers gave several leading AI agents a range of simulated freelance work and found that even the best could perform less than 3 percent of the work, earning $1,810 out of a possible $143,991. The researchers looked at several tools and found the most capable to be Manus from a Chinese startup of the same name, followed by Grok from xAI, Claude from Anthropic, ChatGPT from OpenAI, and Gemini from Google.

“I should hope this gives much more accurate impressions as to what’s going on with AI capabilities,” says Dan Hendrycks, director of CAIS. He adds that while some agents have improved significantly over the past year or so, that does not mean that this will continue at the same rate.

Spectacular AI advances have led to speculation about AI soon surpassing human intelligence and replacing vast numbers of workers. In March, Dario Amodei, CEO of Anthropic, suggested that 90 percent of coding work would be automated within a matter of months.

Previous waves of AI have inspired misplaced predictions about job displacement, for example concerning the imminent replacement of radiologists with AI algorithms.

The researchers generated a range of freelance tasks through verified Upwork workers. The tasks span a range of work including graphic design, video editing, game development, and administrative chores like scraping data. They combined a description of each job with a directory of files needed to perform the work and an example of a finished project produced by a human.

Hendrycks says that while AI models have gotten better at coding, math, and logical reasoning in recent years, they still struggle to use different tools and to perform complex tasks that involve numerous steps. “They don’t have long-term memory storage and can’t do continual learning from experiences. They can’t pick up skills on the job like humans,” he says.

The analysis offers a counterpoint to a benchmark of economic work offered in September by OpenAI called GDPval, which purports to measure economically valuable work. According to GDPval, frontier AI models such as GPT-5 are approaching human abilities on 220 tasks across a range of office jobs. OpenAI did not provide a comment.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleThe Microsoft Azure Outage Shows the Harsh Reality of Cloud Failures
Next Article Ex-L3Harris Cyber Boss Pleads Guilty to Selling Trade Secrets to Russian Firm

Related Articles

Vaping Is ‘Everywhere’ in Schools—Sparking a Bathroom Surveillance Boom
News

Vaping Is ‘Everywhere’ in Schools—Sparking a Bathroom Surveillance Boom

19 November 2025
‘Odd Lots’ Cohost Joe Weisenthal Has Predictions About How the AI Bubble Will Burst
News

‘Odd Lots’ Cohost Joe Weisenthal Has Predictions About How the AI Bubble Will Burst

18 November 2025
This Quest 3S Bundle Is  Off and Includes a Game and Gift Card
News

This Quest 3S Bundle Is $50 Off and Includes a Game and Gift Card

18 November 2025
4 Clever Tricks That Make It Worth Switching to Proton Mail
News

4 Clever Tricks That Make It Worth Switching to Proton Mail

18 November 2025
Microsoft’s Agent 365 Tries to Be the AI Bot Boss
News

Microsoft’s Agent 365 Tries to Be the AI Bot Boss

18 November 2025
Gemini 3 Is Here—and Google Says It Will Make Search Smarter
News

Gemini 3 Is Here—and Google Says It Will Make Search Smarter

18 November 2025
Demo
Top Articles
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024107 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 202496 Views
Costco partners with Electric Era to bring back EV charging in the U.S.

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
Microsoft’s Agent 365 Tries to Be the AI Bot Boss News

Microsoft’s Agent 365 Tries to Be the AI Bot Boss

News Room18 November 2025
Analogue 3D, Which Can Play Any N64 Cartridge, Continues The Company’s Stellar Reputation Gaming

Analogue 3D, Which Can Play Any N64 Cartridge, Continues The Company’s Stellar Reputation

News Room18 November 2025
Gemini 3 Is Here—and Google Says It Will Make Search Smarter News

Gemini 3 Is Here—and Google Says It Will Make Search Smarter

News Room18 November 2025
Most Popular
The Spectacular Burnout of a Solar Panel Salesman

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025135 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024107 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 202496 Views
Our Picks
This Quest 3S Bundle Is  Off and Includes a Game and Gift Card

This Quest 3S Bundle Is $50 Off and Includes a Game and Gift Card

18 November 2025
4 Clever Tricks That Make It Worth Switching to Proton Mail

4 Clever Tricks That Make It Worth Switching to Proton Mail

18 November 2025
Microsoft’s Agent 365 Tries to Be the AI Bot Boss

Microsoft’s Agent 365 Tries to Be the AI Bot Boss

18 November 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.