Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

Neil Patrick Harris Stars In Newly Announced Deadpool VR

7 June 2025

Elon Musk’s Fight With Trump Threatens $48 Billion in Government Contracts

6 June 2025

iFixit Says Switch 2 Is Probably Still Drift Prone

6 June 2025
Facebook X (Twitter) Instagram
Just In
  • Neil Patrick Harris Stars In Newly Announced Deadpool VR
  • Elon Musk’s Fight With Trump Threatens $48 Billion in Government Contracts
  • iFixit Says Switch 2 Is Probably Still Drift Prone
  • Top Smartphones Under Rs. 15,000 in India (June 2025): Samsung Galaxy M16, iQOO Z10x, Infinix Note 50X, More
  • The Game Maker’s Sketchbook Event Sells Prints Of Impressive Video Game Art In The Name Of Charity
  • Cybercriminals Are Hiding Malicious Web Traffic in Plain Sight
  • Huawei Mate XT 2 Tipped to Launch in H2 2025 With Upgraded Chipset, Cameras
  • Barry Diller Invented Prestige TV. Then He Conquered the Internet
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » Google DeepMind’s Chatbot-Powered Robot Is Part of a Bigger Revolution
News

Google DeepMind’s Chatbot-Powered Robot Is Part of a Bigger Revolution

News RoomBy News Room12 July 20244 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

In a cluttered open-plan office in Mountain View, California, a tall and slender wheeled robot has been busy playing tour guide and informal office helper—thanks to a large language model upgrade, Google DeepMind revealed today. The robot uses the latest version of Google’s Gemini large language model to both parse commands and find its way around.

When told by a human “Find me somewhere to write,” for instance, the robot dutifully trundles off, leading the person to a pristine whiteboard located somewhere in the building.

Gemini’s ability to handle video and text—in addition to its capacity to ingest large amounts of information in the form of previously recorded video tours of the office—allows the “Google helper” robot to make sense of its environment and navigate correctly when given commands that require some commonsense reasoning. The robot combines Gemini with an algorithm that generates specific actions for the robot to take, such as turning, in response to commands and what it sees in front of it.

When Gemini was introduced in December, Demis Hassabis, CEO of Google DeepMind, told WIRED that its multimodal capabilities would likely unlock new robot abilities. He added that the company’s researchers were hard at work testing the robotic potential of the model.

In a new paper outlining the project, the researchers behind the work say that their robot proved to be up to 90 percent reliable at navigating, even when given tricky commands such as “Where did I leave my coaster?” DeepMind’s system “has significantly improved the naturalness of human-robot interaction, and greatly increased the robot usability,” the team writes.

Courtesy of Google DeepMind

A photo of a Google DeepMind employee interacting with an AI robot.

Photograph: Muinat Abdul; Google DeepMind

The demo neatly illustrates the potential for large language models to reach into the physical world and do useful work. Gemini and other chatbots mostly operate within the confines of a web browser or app, although they are increasingly able to handle visual and auditory input, as both Google and OpenAI have demonstrated recently. In May, Hassabis showed off an upgraded version of Gemini capable of making sense of an office layout as seen through a smartphone camera.

Academic and industry research labs are racing to see how language models might be used to enhance robots’ abilities. The May program for the International Conference on Robotics and Automation, a popular event for robotics researchers, lists almost two dozen papers that involve use of vision language models.

Investors are pouring money into startups aiming to apply advances in AI to robotics. Several of the researchers involved with the Google project have since left the company to found a startup called Physical Intelligence, which received an initial $70 million in funding; it is working to combine large language models with real-world training to give robots general problem-solving abilities. Skild AI, founded by roboticists at Carnegie Mellon University, has a similar goal. This month it announced $300 million in funding.

Just a few years ago, a robot would need a map of its environment and carefully chosen commands to navigate successfully. Large language models contain useful information about the physical world, and newer versions that are trained on images and video as well as text, known as vision language models, can answer questions that require perception. Gemini allows Google’s robot to parse visual instructions as well as spoken ones, following a sketch on a whiteboard that shows a route to a new destination.

In their paper, the researchers say they plan to test the system on different kinds of robots. They add that Gemini should be able to make sense of more complex questions, such as “Do they have my favorite drink today?” from a user with a lot of empty Coke cans on their desk.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleBest Games of 2024 (So Far) And Anger Foot Review | GI Show
Next Article The giant 77-inch Samsung S90C OLED TV has a $1,300 price cut

Related Articles

News

Elon Musk’s Fight With Trump Threatens $48 Billion in Government Contracts

6 June 2025
News

iFixit Says Switch 2 Is Probably Still Drift Prone

6 June 2025
News

Cybercriminals Are Hiding Malicious Web Traffic in Plain Sight

6 June 2025
News

Barry Diller Invented Prestige TV. Then He Conquered the Internet

6 June 2025
News

Review: Whoop MG

6 June 2025
News

DOGE Is on a Recruiting Spree

6 June 2025
Demo
Top Articles

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202493 Views

5 laptops to buy instead of the M4 MacBook Pro

17 November 202466 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
News

Cybercriminals Are Hiding Malicious Web Traffic in Plain Sight

News Room6 June 2025
Phones

Huawei Mate XT 2 Tipped to Launch in H2 2025 With Upgraded Chipset, Cameras

News Room6 June 2025
News

Barry Diller Invented Prestige TV. Then He Conquered the Internet

News Room6 June 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025123 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202493 Views
Our Picks

Top Smartphones Under Rs. 15,000 in India (June 2025): Samsung Galaxy M16, iQOO Z10x, Infinix Note 50X, More

6 June 2025

The Game Maker’s Sketchbook Event Sells Prints Of Impressive Video Game Art In The Name Of Charity

6 June 2025

Cybercriminals Are Hiding Malicious Web Traffic in Plain Sight

6 June 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.