Code2Worlds: Empowering Coding LLMs for 4D World Generation

Yi Zhang; Yunshuang Wang; Zeyu Zhang; Hao Tang

arXiv:2602.11757·cs.CV·February 13, 2026

Code2Worlds: Empowering Coding LLMs for 4D World Generation

Yi Zhang, Yunshuang Wang, Zeyu Zhang, Hao Tang

PDF

Open Access 1 Datasets

TL;DR

Code2Worlds introduces a novel framework that enables 4D world generation by combining language-based simulation code generation with physics-aware, iterative refinement, advancing beyond static scene generation to dynamic, physically consistent worlds.

Contribution

The paper presents a dual-stream architecture and a closed-loop mechanism for physics-aware 4D world generation from language, addressing multi-scale entanglement and dynamic fidelity challenges.

Findings

01

Outperforms baselines with 41% SGS gain and 49% higher richness.

02

Generates physics-aware, dynamic worlds absent in prior static methods.

03

Demonstrates effective language-to-simulation code translation for 4D environments.

Abstract

Achieving spatial intelligence requires moving beyond visual plausibility to build world simulators grounded in physical laws. While coding LLMs have advanced static 3D scene generation, extending this paradigm to 4D dynamics remains a critical frontier. This task presents two fundamental challenges: multi-scale context entanglement, where monolithic generation fails to balance local object structures with global environmental layouts; and a semantic-physical execution gap, where open-loop code generation leads to physical hallucinations lacking dynamic fidelity. We introduce Code2Worlds, a framework that formulates 4D generation as language-to-simulation code generation. First, we propose a dual-stream architecture that disentangles retrieval-augmented object generation from hierarchical environmental orchestration. Second, to ensure dynamic fidelity, we establish a physics-aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

AIGeeksGroup/Code4D
dataset· 143 dl
143 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · 3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis