Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
What Australia’s 4B Mobile Wallet Payments in 2024 Confirms About Digital Banking

What Australia’s 4B Mobile Wallet Payments in 2024 Confirms About Digital Banking

16 January 2026
This smart ring targets your daily triggers if you get migraines

This smart ring targets your daily triggers if you get migraines

16 January 2026
RFK Jr. Says He’s Ending the War on Protein. It Doesn’t Exist

RFK Jr. Says He’s Ending the War on Protein. It Doesn’t Exist

16 January 2026
Facebook X (Twitter) Instagram
Just In
  • What Australia’s 4B Mobile Wallet Payments in 2024 Confirms About Digital Banking
  • This smart ring targets your daily triggers if you get migraines
  • RFK Jr. Says He’s Ending the War on Protein. It Doesn’t Exist
  • This new phone puts a mini screen next to the camera for your selfies
  • This Pixel patent could make your phone repairs easier
  • Gemini powering the iPhone’s AI is great, but I want Apple to fix Shortcuts first
  • Replit’s AI can build your mobile apps and push them straight to App store
  • Altra Promo Codes: Get 10% Off Plus Free Shipping
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » Researchers Made an IQ Test for AI, Found They’re All Pretty Stupid
AI

Researchers Made an IQ Test for AI, Found They’re All Pretty Stupid

News RoomBy News Room1 December 20234 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Researchers Made an IQ Test for AI, Found They’re All Pretty Stupid
Share
Facebook Twitter LinkedIn Pinterest Email

There’s been a lot of talk about AGI lately—artificial general intelligence—the much-coveted AI development goal that every company in Silicon Valley is currently racing to achieve. AGI refers to a hypothetical point in the future when AI algorithms will be able to do most of the jobs that humans currently do. According to this theory of events, the emergence of AGI will bring about fundamental changes in society—ushering in a “post-work” world, wherein humans can sit around enjoying themselves while robots do most of the heavy lifting. If you believe the headlines, OpenAI’s recent palace intrigue may have been partially inspired by a breakthrough in AGI—the so-called “Q” program—which sources close to the startup claim was responsible for the power struggle.

But, according to recent research from Yann LeCun, Meta’s top AI scientist, artificial intelligence isn’t going to be general-purpose anytime soon. Indeed, in a recently released paper, LeCun argues that AI is still much dumber than humans in the ways that matter most.

That paper, which was co-authored by a host of other scientists (including researchers from other AI startups, like Hugging Face and AutoGPT), looks at how AI’s general-purpose reasoning stacks up against the average human. To measure this, the research team put together its own series of questions that, as the study describes, would be “conceptually simple for humans yet challenging for most advanced AIs.” The questions were given to a sample of humans and also delivered to a plugin-equipped version of GPT-4, the latest large language model from OpenAI. The new research, which has yet to be peer-reviewed, tested AI programs for how they would respond to “real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.”

The questions asked by researchers required the LLM to take a number of steps to ascertain information in order to answer. For instance, in one question, the LLM was asked to visit a specific website and answer a question specific to information on that site; in others, the program would have had to do a general web search for information associated with a person in a photo.

The end result? The LLMs didn’t do very well.

Indeed, the research results show that large language models were typically outmatched by humans when it came to these more complicated real-world problem-solving scenarios. The report notes:

In spite of being successful at tasks that are difficult for humans, the most capable LLMs do poorly on GAIA. Even equipped with tools, GPT4 does not exceed a 30% success rate for the easiest of our tasks, and 0% for the hardest. In the meantime, the average success rate for human respondents is 92%.

“We posit that the advent of Artificial General Intelligence (AGI) hinges on a system’s capability to exhibit similar robustness as the average human does on such questions,” the recent study concludes.

LeCun has diverged from other AI scientists, some of whom have spoken breathlessly about the possibility of AGI being developed in the near term. In recent tweets, the Meta scientist was highly critical of the industry’s current technological capacities, arguing that AI was nowhere near human capacities.

“I have argued, since at least 2016, that AI systems need to have internal models of the world that would allow them to predict the consequences of their actions, and thereby allow them to reason and plan. Current Auto-Regressive LLMs do not have this ability, nor anything close to it, and hence are nowhere near reaching human-level intelligence,” said LeCun in a recent tweet. “In fact, their complete lack of understanding of the physical world and lack of planning abilities puts them way below cat-level intelligence, never mind human-level.”

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleThe Post Holiday Rush Backlog Episode | GI Show
Next Article Leica Hopes Its New $9,500 Camera Can Save Photojournalism From AI

Related Articles

Doom vs Boom: The Battle to Enshrine AI’s Future Into California Law
AI

Doom vs Boom: The Battle to Enshrine AI’s Future Into California Law

24 June 2024
Perplexity Is Reportedly Letting Its AI Break a Basic Rule of the Internet
AI

Perplexity Is Reportedly Letting Its AI Break a Basic Rule of the Internet

20 June 2024
Anthropic Says New Claude 3.5 AI Model Outperforms GPT-4 Omni
AI

Anthropic Says New Claude 3.5 AI Model Outperforms GPT-4 Omni

20 June 2024
Call Centers Introduce ‘Emotion Canceling’ AI as a ‘Mental Shield’ for Workers
AI

Call Centers Introduce ‘Emotion Canceling’ AI as a ‘Mental Shield’ for Workers

18 June 2024
AI Turns Classic Memes Into Hideously Animated Garbage
AI

AI Turns Classic Memes Into Hideously Animated Garbage

17 June 2024
May ‘AI’ Take Your Order? McDonald’s Says Not Yet
AI

May ‘AI’ Take Your Order? McDonald’s Says Not Yet

17 June 2024
Demo
Top Articles
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024107 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024101 Views
Costco partners with Electric Era to bring back EV charging in the U.S.

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202497 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
Gemini powering the iPhone’s AI is great, but I want Apple to fix Shortcuts first News

Gemini powering the iPhone’s AI is great, but I want Apple to fix Shortcuts first

News Room16 January 2026
Replit’s AI can build your mobile apps and push them straight to App store News

Replit’s AI can build your mobile apps and push them straight to App store

News Room16 January 2026
Altra Promo Codes: Get 10% Off Plus Free Shipping News

Altra Promo Codes: Get 10% Off Plus Free Shipping

News Room16 January 2026
Most Popular
The Spectacular Burnout of a Solar Panel Salesman

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025136 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024107 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024101 Views
Our Picks
This new phone puts a mini screen next to the camera for your selfies

This new phone puts a mini screen next to the camera for your selfies

16 January 2026
This Pixel patent could make your phone repairs easier

This Pixel patent could make your phone repairs easier

16 January 2026
Gemini powering the iPhone’s AI is great, but I want Apple to fix Shortcuts first

Gemini powering the iPhone’s AI is great, but I want Apple to fix Shortcuts first

16 January 2026

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2026 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.