Code2Worlds: Empowering Coding LLMs for 4D World Generation
Yi Zhang, Yunshuang Wang, Zeyu Zhang, Hao Tang

TL;DR
Code2Worlds introduces a novel framework that enables 4D world generation by combining language-based simulation code generation with physics-aware, iterative refinement, advancing beyond static scene generation to dynamic, physically consistent worlds.
Contribution
The paper presents a dual-stream architecture and a closed-loop mechanism for physics-aware 4D world generation from language, addressing multi-scale entanglement and dynamic fidelity challenges.
Findings
Outperforms baselines with 41% SGS gain and 49% higher richness.
Generates physics-aware, dynamic worlds absent in prior static methods.
Demonstrates effective language-to-simulation code translation for 4D environments.
Abstract
Achieving spatial intelligence requires moving beyond visual plausibility to build world simulators grounded in physical laws. While coding LLMs have advanced static 3D scene generation, extending this paradigm to 4D dynamics remains a critical frontier. This task presents two fundamental challenges: multi-scale context entanglement, where monolithic generation fails to balance local object structures with global environmental layouts; and a semantic-physical execution gap, where open-loop code generation leads to physical hallucinations lacking dynamic fidelity. We introduce Code2Worlds, a framework that formulates 4D generation as language-to-simulation code generation. First, we propose a dual-stream architecture that disentangles retrieval-augmented object generation from hierarchical environmental orchestration. Second, to ensure dynamic fidelity, we establish a physics-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · 3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis
