Google DeepMind has released a brand-new robotics foundation model, Gemini Robotics ER 1.6, where “ER” stands for Embodied Reasoning (embodied reasoning). This model achieves the current best performance (SOTA) in visual and spatial reasoning, and is already available through the Gemini API. Logan Kilpatrick, the Head of Developer Relations at Google AI, announced this on social media. (Source)

What is Embodied Reasoning?

Embodied Reasoning refers to an AI model’s ability to understand and reason about the physical world. Unlike traditional language models, embodied reasoning models must process the positions, shapes, materials, and physical interaction relationships of objects in three-dimensional space. Gemini Robotics ER 1.6 is specifically optimized for these kinds of tasks, enabling robots to understand their surroundings more accurately and make appropriate action decisions.

Core capabilities

The main advantages of Gemini Robotics ER 1.6 focus on two areas:

Capability Description Visual reasoning Able to identify objects from images and videos, understand the structure of the scene, and make decisions accordingly Spatial reasoning Understand the relative positions, distances, and directions of objects in three-dimensional space, supporting complex operation planning

The combination of these two capabilities allows robots to handle more complex real-world tasks. For example, in a warehouse environment, robots need to identify objects of different shapes at the same time and calculate the best grasp angle and placement position—this is exactly the kind of scenario Gemini Robotics ER 1.6 excels at.

Using the Gemini API

Unlike many past robot models that only existed at the paper stage, Gemini Robotics ER 1.6 is already accessible via the Gemini API. This means developers and hardware vendors can integrate this model directly into their own robotic systems, without having to train the model from scratch.

Opening up the API also lowers the development barrier for robot AI. In the past, building a robot system with visual and spatial reasoning capabilities required a large amount of data collection and model training work. Now, developers can focus on developing hardware design and application scenarios, leaving the underlying reasoning capabilities to Gemini Robotics ER 1.6.

Google’s robotics AI roadmap

Gemini Robotics ER 1.6 is the latest achievement by Google DeepMind in the field of robotics. From the early RT-2 to the present Gemini Robotics series, Google has continued extending the capabilities of large language models into interactions with the physical world. The ER 1.6 version further improves reasoning accuracy on top of its predecessors, performing especially well in scenarios that require precise operations.

As the robotics industry enters a new growth cycle, foundation models with strong visual and spatial reasoning capabilities will become key infrastructure. To learn more about the development of the Gemini ecosystem, you can refer to the complete Gemini guide.

This article Google launches Gemini Robotics ER 1.6: SOTA robot model, strong in visual and spatial reasoning was first published on Chain News ABMedia.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Meta Stock Rises 1.73% as Company Plans 8,000-Job Layoff Starting May 20

Stocks AI Industry News

Meta Platforms plans to cut about 8,000 jobs, or 10% of its workforce, starting May 20, despite rising stock prices. The company, with over $200 billion in revenue, is focusing on AI investments amid significant restructuring, aligning with industry trends of layoffs.

GateNews4h ago

Google’s annual report says Gemini achieves millisecond interception, blocking 99% of scam ads

AI Industry News

The article discusses how Google strengthens ad safety through its generative AI system, Gemini. The report shows that the speed at which it blocks noncompliant ads has been reduced to milliseconds, with a blocking rate of 99%. Last year, Google removed 8.3 billion ad listings and suspended 24.9 million accounts, indicating a significant rise in the number of scam ads. Experts point out that this is a contest between AI and AI, and that in the future there will still be challenges in dealing with both legal and illegal activities brought about by AI.

ChainNewsAbmedia6h ago

Ethereum Co-founder Lubin: AI Will Be Critical Turning Point for Crypto, But Tech Giant Monopoly Poses Systemic Risk

ethereum news AI Agent AI Industry News

Ethereum co-founder Joseph Lubin emphasized the transformative potential of AI for the cryptocurrency sector while cautioning against the risks of centralization among tech giants. He envisions AI-driven autonomous transactions on blockchain and highlights the convergence of traditional finance with DeFi.

GateNews8h ago

Elon Musk Pushes 'Universal High Income' Checks as Ultimate Solution for AI Unemployment

AI Industry News

Elon Musk advocates for a Universal High Income to combat AI-induced unemployment, envisioning a future with ample goods and zero inflation. In contrast, experts like Sam Altman raise concerns about job loss and propose protective measures for workers.

Coinpedia8h ago

DeepSeek Reportedly Launches First External Fundraising Round, Targets $10B+ Valuation and $300M+

AI Industry News

DeepSeek, a Chinese AI startup, is negotiating its first external funding round, aiming for at least $300 million at a $10 billion valuation. Despite previous rejections of investment offers, its fundraising discussions are now reportedly underway.

GateNews9h ago

ChatGPT ads move into Australia and New Zealand: Free and Go users first, paid plans stay ad-free

AI Industry News

OpenAI expanded ChatGPT advertising on April 17, 2023 to Australia, New Zealand, and Canada for Free and Go users, with no ads for paid users. This marks the second pathway toward AI commercialization and takes into account business and regulatory risks, where the presence of ads can promote paid conversions.

ChainNewsAbmedia11h ago

Comment

0/400

No comments