Google DeepMind has launched Gemini Robotics 1.5, a new vision-language-action (VLA) model designed to help robots perform complex, multi-step tasks with greater autonomy and transparency.
The release includes two complementary models: Gemini Robotics 1.5 and Gemini Robotics-ER 1.5. The former is DeepMind’s most advanced VLA system to date, capable of turning visual input and instructions into motor commands.
Unlike previous generations, it generates reasoning steps before acting, allowing robots to explain their decision-making processes and adapt more effectively to new environments. [Read more…] about DeepMind unveils Gemini Robotics 1.5 to bring AI agents into the physical world