ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment

Chong Xia; Shengjun Zhang; Fangfu Liu; Chang Liu; Khodchaphun Hirunyaratsameewong; Yueqi Duan

arXiv:2507.19058·cs.CV·July 28, 2025

ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment

Chong Xia, Shengjun Zhang, Fangfu Liu, Chang Liu, Khodchaphun Hirunyaratsameewong, Yueqi Duan

PDF

TL;DR

ScenePainter is a novel framework for perpetual 3D scene generation that maintains semantic consistency across long sequences by aligning scene concepts and dynamically refining relations, improving coherence and diversity.

Contribution

It introduces SceneConceptGraph for relation modeling and alignment, addressing semantic drift in long-range 3D scene synthesis, which is a significant advancement over existing methods.

Findings

01

Overcomes semantic drift in 3D scene generation

02

Produces more consistent and immersive 3D view sequences

03

Enhances diversity through dynamic relation refinement

Abstract

Perpetual 3D scene generation aims to produce long-range and coherent 3D view sequences, which is applicable for long-term video synthesis and 3D scene reconstruction. Existing methods follow a "navigate-and-imagine" fashion and rely on outpainting for successive view expansion. However, the generated view sequences suffer from semantic drift issue derived from the accumulated deviation of the outpainting module. To tackle this challenge, we propose ScenePainter, a new framework for semantically consistent 3D scene generation, which aligns the outpainter's scene-specific prior with the comprehension of the current scene. To be specific, we introduce a hierarchical graph structure dubbed SceneConceptGraph to construct relations among multi-level scene concepts, which directs the outpainter for consistent novel views and can be dynamically refined to enhance diversity. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.