Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

iQOO 13 Green Colour Variant Launched in India: Check Price, Availability

4 July 2025

This Is Why Tesla’s Robotaxi Launch Needed Human Babysitters

4 July 2025

Vivo X Fold 5, Vivo X200 FE Price in India Leaked Ahead of Debut on July 14

4 July 2025
Facebook X (Twitter) Instagram
Just In
  • iQOO 13 Green Colour Variant Launched in India: Check Price, Availability
  • This Is Why Tesla’s Robotaxi Launch Needed Human Babysitters
  • Vivo X Fold 5, Vivo X200 FE Price in India Leaked Ahead of Debut on July 14
  • The EU Proposes New Rules to Govern the European Space Race
  • Android 16’s Live Updates to Show Active Navigation, Ongoing Phone Calls, and More on Lock Screen
  • Google Pixel 6a Owners Eligible for $100 Cash or $150 Store Credit Under Battery Performance Programme
  • Tecno Pova 7 5G – Price in India, Specifications (4th July 2025)
  • Tecno Pova 7 5G, Pova 7 Pro 5G Launched in India With MediaTek Dimensity 7300 Ultimate SoC
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » Gemini AI is making robots in the office far more useful
News

Gemini AI is making robots in the office far more useful

News RoomBy News Room11 July 20243 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

Lost in an unfamiliar office building, big box store, or warehouse? Just ask the nearest robot for directions.

A team of Google researchers combined the powers of natural language processing and computer vision to develop a novel means of robotic navigation as part of a new study published Wednesday.

Essentially, the team set out to teach a robot — in this case an Everyday Robot — how to navigate through an indoor space using natural language prompts and visual inputs. Robotic navigation used to require researchers to not only map out the environment ahead of time but also provide specific physical coordinates within the space to guide the machine. Recent advances in what’s known as Vision Language navigation have enabled users to simply give robots natural language commands, like “go to the workbench.” Google’s researchers are taking that concept a step further by incorporating multimodal capabilities, so that the robot can accept natural language and image instructions at the same time.

For example, a user in a warehouse would be able to show the robot an item and ask, “what shelf does this go on?” Leveraging the power of Gemini 1.5 Pro, the AI interprets both the spoken question and the visual information to formulate not just a response but also a navigation path to lead the user to the correct spot on the warehouse floor. The robots were also tested with commands like, “Take me to the conference room with the double doors,” “Where can I borrow some hand sanitizer,” and “I want to store something out of sight from public eyes. Where should I go?”

Or, in the Instagram Reel above, a researcher activates the system with an “OK robot” before asking to be led somewhere where “he can draw.” The robot responds with “give me a minute. Thinking with Gemini …” before setting off briskly through the 9,000-square-foot DeepMind office in search of a large wall-mounted whiteboard.

To be fair, these trailblazing robots were already familiar with the office space’s layout. The team utilized a technique known as “Multimodal Instruction Navigation with demonstration Tours (MINT).” This involved the team first manually guiding the robot around the office, pointing out specific areas and features using natural language, though the same effect can be achieved by simply recording a video of the space using a smartphone. From there the AI generates a topological graph where it works to match what its cameras are seeing with the “goal frame” from the demonstration video.

Then, the team employs a hierarchical Vision-Language-Action (VLA) navigation policy “combining the environment understanding and common sense reasoning,” to instruct the AI on how to translate user requests into navigational action.

The results were very successful with the robots achieving “86 percent and 90 percent end-to-end success rates on previously infeasible navigation tasks involving complex reasoning and multimodal user instructions in a large real world environment,” the researchers wrote.

However, they recognize that there is still room for improvement, pointing out that the robot cannot (yet) autonomously perform its own demonstration tour and noting that the AI’s ungainly inference time (how long it takes to formulate a response) of 10 to 30 seconds turns interacting with the system a study in patience.











Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleHow Watermelon Cupcakes Kicked Off an Internal Storm at Meta
Next Article OpenAI Is Testing Its Powers of Persuasion

Related Articles

News

This Is Why Tesla’s Robotaxi Launch Needed Human Babysitters

4 July 2025
News

The EU Proposes New Rules to Govern the European Space Race

4 July 2025
News

The Person in Charge of Testing Tech for US Spies Has Resigned

4 July 2025
News

Trump’s Defiance of TikTok Ban Prompted Immunity Promises to 10 Tech Companies

4 July 2025
News

The 61 Best Early Amazon Prime Day Deals

3 July 2025
News

A Game Called Date Everything Literally Lets You Date Everything—Except People

3 July 2025
Demo
Top Articles

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024101 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

Oppo Reno 14, Reno 14 Pro India Launch Timeline and Colourways Leaked

27 May 202581 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
Phones

Google Pixel 6a Owners Eligible for $100 Cash or $150 Store Credit Under Battery Performance Programme

News Room4 July 2025
Laptops

Tecno Pova 7 5G – Price in India, Specifications (4th July 2025)

News Room4 July 2025
Phones

Tecno Pova 7 5G, Pova 7 Pro 5G Launched in India With MediaTek Dimensity 7300 Ultimate SoC

News Room4 July 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025124 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024101 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views
Our Picks

The EU Proposes New Rules to Govern the European Space Race

4 July 2025

Android 16’s Live Updates to Show Active Navigation, Ongoing Phone Calls, and More on Lock Screen

4 July 2025

Google Pixel 6a Owners Eligible for $100 Cash or $150 Store Credit Under Battery Performance Programme

4 July 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.