Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Meng Chu; Xuan Billy Zhang; Kevin Qinghong Lin; Lingdong Kong; Jize Zhang; Teng Tu; Weijian Ma; Ziqi Huang; Senqiao Yang; Wei Huang; Yeying Jin; Zhefan Rao; Jinhui Ye; Xinyu Lin; Xichen Zhang; Qisheng Hu; Shuai Yang; Leyang Shen; Wei Chow; Yifei Dong; Fengyi Wu; Quanyu Long; Bin Xia; Shaozuo Yu; Mingkang Zhu; Wenhu Zhang; Jiehui Huang; Haokun Gui; Haoxuan Che; Long Chen; Qifeng Chen; Wenxuan Zhang; Wenya Wang; Xiaojuan Qi; Yang Deng; Yanwei Li; Mike Zheng Shou; Zhi-Qi Cheng; See-Kiong Ng; Ziwei Liu; Philip Torr; Jiaya Jia

arXiv:2604.22748·cs.AI·April 27, 2026

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Meng Chu, Xuan Billy Zhang, Kevin Qinghong Lin, Lingdong Kong, Jize Zhang, Teng Tu, Weijian Ma, Ziqi Huang, Senqiao Yang, Wei Huang, Yeying Jin, Zhefan Rao, Jinhui Ye, Xinyu Lin, Xichen Zhang, Qisheng Hu, Shuai Yang, Leyang Shen, Wei Chow, Yifei Dong, Fengyi Wu, Quanyu Long

PDF

1 Repo

TL;DR

This paper introduces a comprehensive taxonomy for environment modeling in AI, categorizing models by capability levels and governing laws, and synthesizes existing research to guide future development.

Contribution

It proposes a novel 'levels x laws' taxonomy for world models, synthesizes over 400 works, and provides a roadmap for advancing AI environment modeling.

Findings

01

Synthesized 400+ works across multiple domains.

02

Analyzed failure modes and evaluation practices.

03

Outlined architectural guidance and open problems.

Abstract

As AI systems move from generating text to accomplishing goals through sustained interaction, the ability to model environment dynamics becomes a central bottleneck. Agents that manipulate objects, navigate software, coordinate with others, or design experiments require predictive environment models, yet the term world model carries different meanings across research communities. We introduce a "levels x laws" taxonomy organized along two axes. The first defines three capability levels: L1 Predictor, which learns one-step local transition operators; L2 Simulator, which composes them into multi-step, action-conditioned rollouts that respect domain laws; and L3 Evolver, which autonomously revises its own model when predictions fail against new evidence. The second identifies four governing-law regimes: physical, digital, social, and scientific. These regimes determine what constraints a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

matrix-agent/awesome-agentic-world-modeling
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.