A Step Toward World Models: A Survey on Robotic Manipulation
Peng-Fei Zhang, Ying Cheng, Xiaofan Sun, Shijie Wang, Fengling Li, Lei Zhu, Heng Tao Shen

TL;DR
This survey reviews approaches in robotic manipulation that develop internal world models, analyzing their architectures, capabilities, and challenges to motivate progress toward generalizable robotic understanding.
Contribution
It provides a comprehensive analysis of existing methods exhibiting world model capabilities in robotics, clarifying their roles, components, and challenges to guide future development.
Findings
Identifies key components and functions of effective world models.
Highlights challenges in perception, prediction, and control integration.
Suggests directions for developing generalizable and practical world models.
Abstract
Autonomous agents are increasingly expected to operate in complex, dynamic, and uncertain environments, performing tasks such as manipulation, navigation, and decision-making. Achieving these capabilities requires agents to understand the underlying mechanisms and dynamics of the world, moving beyond reactive control or simple replication of observed states. This motivates the development of world models as internal representations that encode environmental states, capture dynamics, and support prediction, planning, and reasoning. Despite growing interest, the definition, scope, architectures, and essential capabilities of world models remain ambiguous. In this survey, we go beyond prescribing a fixed definition and limiting our scope to methods explicitly labeled as world models. Instead, we examine approaches that exhibit the core capabilities of world models through a review of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
