Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

Is this AI video generator the future of anime? Definitely not

10 May 2025

How to Use Your iPad as a Second Monitor With Your Mac

10 May 2025

Here’s how to watch Sony’s Xperia 1 VII launch event

10 May 2025
Facebook X (Twitter) Instagram
Just In
  • Is this AI video generator the future of anime? Definitely not
  • How to Use Your iPad as a Second Monitor With Your Mac
  • Here’s how to watch Sony’s Xperia 1 VII launch event
  • Dismantling NOAA Threatens the World’s Ability to Monitor Carbon Dioxide Levels
  • Sony Xperia 1 VII Design, Colour Options Spotted in Leaked Renders; Sony WH-1000XM6 to Debut on May 15
  • Samsung Galaxy S25 FE might favor power over price
  • LG G5 vs. LG C5 – is the cheaper option good enough?
  • Garmin Vivoactive 6 review: Still my favorite fitness watch
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » Project Astra Is Google’s ‘Multimodal’ Answer to the New ChatGPT
News

Project Astra Is Google’s ‘Multimodal’ Answer to the New ChatGPT

News RoomBy News Room15 May 20243 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

Pulkit Agrawal, an assistant professor at MIT who works on AI and robotics, says Google’s and OpenAI’s latest demos are impressive and show how rapidly multimodal AI models have advanced. OpenAI launched GPT-4V, a system capable of parsing images in September 2023. He was impressed that Gemini is able to make sense of live video—for example, correctly interpreting changes made to a diagram on a whiteboard in real time. OpenAI’s new version of ChatGPT appears capable of the same.

Agrawal says the assistants demoed by Google and OpenAI could provide new training data for the companies as users interact with the models in the real world. “But they have to be useful,” he adds. “The big question is what will people use them for—it’s not very clear.”

Google says Project Astra will be made available through a new interface called Gemini Live later this year. Hassabis said that the company is still testing several prototype smart glasses and has yet to make a decision on whether to launch any of them.

Astra’s capabilities might provide Google a chance to reboot a version of its ill-fated Glass smart glasses, although efforts to build hardware suited to generative AI have stumbled so far. Despite OpenAI and Google’s impressive demos, multimodal modals cannot fully understand the physical world and objects within it, placing limitations on what they will be able to do.

“Being able to build a mental model of the physical world around you is absolutely essential to building more humanlike intelligence,” says Brenden Lake, an associate professor at New York University who uses AI to explore human intelligence.

Lake notes that today’s best AI models are still very language-centric because the bulk of their learning comes from text slurped from books and the web. This is fundamentally different from how language is learned by humans, who pick it up while interacting with the physical world. “It’s backwards compared to child development,” he says of the process of creating multimodal models.

Hassabis believes that imbuing AI models with a deeper understanding of the physical world will be key to further progress in AI, and to making systems like Project Astra more robust. Other frontiers of AI, including Google DeepMind’s work on game-playing AI programs could help, he says. Hassabis and others hope such work could be revolutionary for robotics, an area that Google is also investing in.

“A multimodal universal agent assistant is on the sort of track to artificial general intelligence,” Hassabis said in reference to a hoped-for but largely undefined future point where machines can do anything and everything that a human mind can. “This is not AGI or anything, but it’s the beginning of something.”

Updated 5-14-2024, 4:15 pm EDT: This article has been updated to clarify the full name of Google’s project.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleTecno Camon 30 Series to 2 Major Android Update, 3 Years of Security Upgrades
Next Article Air cooling vs. liquid cooling: Which is best for your PC in 2024?

Related Articles

News

Is this AI video generator the future of anime? Definitely not

10 May 2025
News

How to Use Your iPad as a Second Monitor With Your Mac

10 May 2025
News

Here’s how to watch Sony’s Xperia 1 VII launch event

10 May 2025
News

Dismantling NOAA Threatens the World’s Ability to Monitor Carbon Dioxide Levels

10 May 2025
News

Samsung Galaxy S25 FE might favor power over price

10 May 2025
News

LG G5 vs. LG C5 – is the cheaper option good enough?

10 May 2025
Demo
Top Articles

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202493 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202482 Views

5 laptops to buy instead of the M4 MacBook Pro

17 November 202457 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
News

Samsung Galaxy S25 FE might favor power over price

News Room10 May 2025
News

LG G5 vs. LG C5 – is the cheaper option good enough?

News Room10 May 2025
News

Garmin Vivoactive 6 review: Still my favorite fitness watch

News Room10 May 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025118 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202493 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202482 Views
Our Picks

Dismantling NOAA Threatens the World’s Ability to Monitor Carbon Dioxide Levels

10 May 2025

Sony Xperia 1 VII Design, Colour Options Spotted in Leaked Renders; Sony WH-1000XM6 to Debut on May 15

10 May 2025

Samsung Galaxy S25 FE might favor power over price

10 May 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.