TL;DR
LivingWorld is an interactive framework that generates 4D worlds with coherent environmental dynamics from a single image, supporting real-time scene expansion and dynamic effects like clouds and water.
Contribution
It introduces a novel globally coherent motion field construction method and a geometry-aware alignment module for interactive 4D scene generation from a single image.
Findings
Generates 4D worlds with environmental dynamics in 9 seconds per scene expansion.
Maintains global consistency and coherence in environmental motion across scene expansions.
Supports long, temporally coherent 4D sequences without expensive video refinement.
Abstract
We introduce LivingWorld, an interactive framework for generating 4D worlds with environmental dynamics from a single image. While recent advances in 3D scene generation enable large-scale environment creation, most approaches focus primarily on reconstructing static geometry, leaving scene-scale environmental dynamics such as clouds, water, or smoke largely unexplored. Modeling such dynamics is challenging because motion must remain coherent across an expanding scene while supporting low-latency user feedback. LivingWorld addresses this challenge by progressively constructing a globally coherent motion field as the scene expands. To maintain global consistency during expansion, we introduce a geometry-aware alignment module that resolves directional and scale ambiguities across views. We further represent motion using a compact hash-based motion field, enabling efficient querying and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
