Web World Models
Jichen Feng, Yifan Zhang, Chenggong Zhang, Yifu Lu, Shilong Liu, Mengdi Wang

TL;DR
The paper introduces Web World Models (WWMs), a hybrid approach combining web code for logical consistency with large language models for high-level generation, enabling scalable, controllable, and open-ended virtual worlds.
Contribution
It proposes a novel framework that integrates web programming with language models to create structured, scalable virtual environments with practical design principles.
Findings
WWMs enable structured yet open-ended worlds.
Web stacks can serve as scalable substrates for world models.
Design principles improve controllability and diversity of environments.
Abstract
Language agents increasingly require persistent worlds in which they can act, remember, and learn. Existing approaches sit at two extremes: conventional web frameworks provide reliable but fixed contexts backed by databases, while fully generative world models aim for unlimited environments at the expense of controllability and practical engineering. In this work, we introduce the Web World Model (WWM), a middle ground where world state and ``physics'' are implemented in ordinary web code to ensure logical consistency, while large language models generate context, narratives, and high-level decisions on top of this structured latent state. We build a suite of WWMs on a realistic web stack, including an infinite travel atlas grounded in real geography, fictional galaxy explorers, web-scale encyclopedic and narrative worlds, and simulation- and game-like environments. Across these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Geographic Information Systems Studies · Data Visualization and Analytics
