SIMA 2: A Generalist Embodied Agent for Virtual Worlds

SIMA team: Adrian Bolton; Alexander Lerchner; Alexandra Cordell; Alexandre Moufarek; Andrew Bolt; Andrew Lampinen; Anna Mitenkova; Arne Olav Hallingstad; Bojan Vujatovic; Bonnie Li; Cong Lu; Daan Wierstra; Daniel P. Sawyer; Daniel Slater; David Reichert; Davide Vercelli; Demis Hassabis; Drew A. Hudson; Duncan Williams; Ed Hirst; Fabio Pardo; Felix Hill; Frederic Besse; Hannah Openshaw; Harris Chan; Hubert Soyer; Jane X. Wang; Jeff Clune; John Agapiou; John Reid; Joseph Marino; Junkyung Kim; Karol Gregor; Kaustubh Sridhar; Kay McKinney; Laura Kampis; Lei M. Zhang; Loic Matthey; Luyu Wang; Maria Abi Raad; Maria Loks-Thompson; Martin Engelcke; Matija Kecman; Matthew Jackson; Maxime Gazeau; Ollie Purkiss; Oscar Knagg; Peter Stys; Piermaria Mendolicchio; Raia Hadsell; Rosemary Ke; Ryan Faulkner; Sarah Chakera; Satinder Singh Baveja; Shane Legg; Sheleem Kashem; Tayfun Terzi; Thomas Keck; Tim Harley; Tim Scholtes; Tyson Roberts; Volodymyr Mnih; Yulan Liu; Zhengdong Wang; Zoubin Ghahramani

arXiv:2512.04797·cs.AI·December 5, 2025

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

SIMA team: Adrian Bolton, Alexander Lerchner, Alexandra Cordell, Alexandre Moufarek, Andrew Bolt, Andrew Lampinen, Anna Mitenkova, Arne Olav Hallingstad, Bojan Vujatovic, Bonnie Li, Cong Lu, Daan Wierstra, Daniel P. Sawyer, Daniel Slater, David Reichert, Davide Vercelli

PDF

Open Access

TL;DR

SIMA 2 is a versatile embodied agent capable of understanding and acting in diverse 3D virtual worlds, demonstrating advanced reasoning, interaction, and autonomous learning capabilities, marking progress toward adaptable agents in virtual and real environments.

Contribution

Introduces SIMA 2, a generalist embodied agent built on Gemini, capable of complex interactions, reasoning, and autonomous skill acquisition in virtual worlds, surpassing prior simple command-based models.

Findings

01

SIMA 2 approaches human performance in diverse games.

02

The agent generalizes well to unseen environments.

03

It can autonomously learn new skills through self-generated tasks.

Abstract

We introduce SIMA 2, a generalist embodied agent that understands and acts in a wide variety of 3D virtual worlds. Built upon a Gemini foundation model, SIMA 2 represents a significant step toward active, goal-directed interaction within an embodied environment. Unlike prior work (e.g., SIMA 1) limited to simple language commands, SIMA 2 acts as an interactive partner, capable of reasoning about high-level goals, conversing with the user, and handling complex instructions given through language and images. Across a diverse portfolio of games, SIMA 2 substantially closes the gap with human performance and demonstrates robust generalization to previously unseen environments, all while retaining the base model's core reasoning capabilities. Furthermore, we demonstrate a capacity for open-ended self-improvement: by leveraging Gemini to generate tasks and provide rewards, SIMA 2 can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Social Robot Interaction and HRI · Reinforcement Learning in Robotics