DeepMind's latest AI model can help robots fold origami and close Ziploc bags

Since its debut at the end of last year, Gemini 2.0 has gone on to power a handful of Google products, including a new AI Mode chatbot. Now Google DeepMind is using that same technology for something altogether more interesting. On Wednesday, the AI lab announced two new Gemini-based models it says will "lay the foundation for a new generation of helpful robots."

The first, Gemini Robotics, was designed by Deepmind to facilitate direct control of robots. According to the company, AI systems for robots need to excel at three qualities: generality, interactivity and dexterity.

The first involves a robot's flexibility to adapt to novel situations, including ones not covered by its training. Interactivity, meanwhile, encapsulates a robot's ability to respond to people and the environment. Finally, there's dexterity, which is mostly self-explanatory: a lot of tasks humans can complete without a second thought involve fine motor skills that are difficult for robots to master.

"While our previous work demonstrated progress in these areas, Gemini Robotics represents a substantial step in performance on all three axes, getting us closer to truly general purpose robots," says DeepMind.

For instance, with Gemini Robotics powering it, DeepMind's ALOHA 2 robot is able to fold origami and close a Ziploc bag. The two-armed robot also understands all the instructions given to it in natural, everyday language. As you can see from the video Google shared, it can even complete tasks despite encountering roadblocks, such as when the researcher moves around the Tupperware he just asked the robot to place the fruit inside of.

Google is partnering with Apptronik, the company behind the Apollo bipedal robot, to build the next generation of humanoid robots. At the same time, DeepMind is releasing Gemini Robotics-ER (or embodied reasoning). Of the second model, the company says it will enable roboticists to run their own programs using Gemini's advanced reasoning abilities. DeepMind is giving "trusted testers," including one-time Google subsidiary Boston Dynamics, access to the system.

This article originally appeared on Engadget at https://www.engadget.com/ai/deepminds-latest-ai-model-can-help-robots-fold-origami-and-close-ziploc-bags-151455249.html?src=rss https://www.engadget.com/ai/deepminds-latest-ai-model-can-help-robots-fold-origami-and-close-ziploc-bags-151455249.html?src=rss
Létrehozva 6mo | 2025. márc. 12. 17:20:13


Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

BioShock creator Ken Levine's Judas game still exists, now has key art

Remember Judas? No, not the biblical figure and not the

2025. aug. 27. 21:20:20 | Engadget
Microsoft Copilot is now a talking blob on Samsung TVs

Copilot, Microsoft's AI assistant that's integrated into Windows and Microsoft 365, is making the jump to your living room. The company

2025. aug. 27. 21:20:18 | Engadget
Crystal Dynamics announces layoffs, but says Tomb Raider will not be impacted

Crystal Dynamics, the studio behind the recent Tomb Raider games, announced an unspecified number of layoffs today. In a

2025. aug. 27. 21:20:17 | Engadget
The PS Plus monthly games for September include Psychonauts 2 and Stardew Valley

A new month is almost upon us, which means Sony is about to drop some fresh games that all PlayStation Plus members can keep in their collection as long as they maintain a subscription. There are s

2025. aug. 27. 18:50:55 | Engadget
Google Pixel 10 Pro and Pro XL review: Redefining the smart in smartphone

In the ‘90s, the term “smartphone” emerged to denote devices with “advanced comput

2025. aug. 27. 18:50:51 | Engadget
Google Pixel 10 review: The new smartphone standard

Google marked the tenth generation of Pixels

2025. aug. 27. 18:50:48 | Engadget
Anthropic admits its AI is being used to conduct cybercrime

Anthropic’s agentic AI, Claude, has been "weaponized" in high-level cyberattacks, according to a new

2025. aug. 27. 18:50:45 | Engadget