Lyra 2.0: Explorable Generative 3D Worlds

Tianchang Shen; Sherwin Bahmani; Kai He; Sangeetha Grama Srinivasan; Tianshi Cao; Jiawei Ren; Ruilong Li; Zian Wang; Nicholas Sharp; Zan Gojcic; Sanja Fidler; Jiahui Huang; Huan Ling; Jun Gao; Xuanchi Ren

arXiv:2604.13036·cs.CV·April 15, 2026

Lyra 2.0: Explorable Generative 3D Worlds

Tianchang Shen, Sherwin Bahmani, Kai He, Sangeetha Grama Srinivasan, Tianshi Cao, Jiawei Ren, Ruilong Li, Zian Wang, Nicholas Sharp, Zan Gojcic, Sanja Fidler, Jiahui Huang, Huan Ling, Jun Gao, Xuanchi Ren

PDF

8 Models

TL;DR

Lyra 2.0 introduces a scalable framework for generating long, consistent 3D worlds by addressing spatial forgetting and temporal drifting in video-based scene creation.

Contribution

It proposes a novel method that maintains per-frame 3D geometry and trains with self-augmented histories to improve long-horizon, 3D-consistent video generation.

Findings

01

Enables longer, more consistent 3D scene trajectories.

02

Improves scene appearance and geometry fidelity over extended sequences.

03

Facilitates reliable 3D scene reconstruction from generated videos.

Abstract

Recent advances in video generation enable a new paradigm for 3D scene creation: generating camera-controlled videos that simulate scene walkthroughs, then lifting them to 3D via feed-forward reconstruction techniques. This generative reconstruction approach combines the visual fidelity and creative capacity of video models with 3D outputs ready for real-time rendering and simulation. Scaling to large, complex environments requires 3D-consistent video generation over long camera trajectories with large viewpoint changes and location revisits, a setting where current video models degrade quickly. Existing methods for long-horizon generation are fundamentally limited by two forms of degradation: spatial forgetting and temporal drifting. As exploration proceeds, previously observed regions fall outside the model's temporal context, forcing the model to hallucinate structures when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.