GEN3D: Generating Domain-Free 3D Scenes from a Single Image

Yuxin Zhang; Ziyu Lu; Hongbo Duan; Keyu Fan; Pengting Luo; Peiyu Zhuang; Mengyu Yang; Houde Liu

arXiv:2511.14291·cs.CV·November 19, 2025

GEN3D: Generating Domain-Free 3D Scenes from a Single Image

Yuxin Zhang, Ziyu Lu, Hongbo Duan, Keyu Fan, Pengting Luo, Peiyu Zhuang, Mengyu Yang, Houde Liu

PDF

Open Access

TL;DR

Gen3d is a novel method that generates high-quality, diverse 3D scenes from a single image, overcoming the limitations of multi-view dependence and advancing applications in embodied AI and world modeling.

Contribution

It introduces a new approach for single-image 3D scene generation that maintains and expands a world model, utilizing Gaussian splatting for high-fidelity scene synthesis.

Findings

01

Strong generalization across diverse datasets

02

Superior performance in novel view synthesis

03

Effective 3D scene generation from minimal input

Abstract

Despite recent advancements in neural 3D reconstruction, the dependence on dense multi-view captures restricts their broader applicability. Additionally, 3D scene generation is vital for advancing embodied AI and world models, which depend on diverse, high-quality scenes for learning and evaluation. In this work, we propose Gen3d, a novel method for generation of high-quality, wide-scope, and generic 3D scenes from a single image. After the initial point cloud is created by lifting the RGBD image, Gen3d maintains and expands its world model. The 3D scene is finalized through optimizing a Gaussian splatting representation. Extensive experiments on diverse datasets demonstrate the strong generalization capability and superior performance of our method in generating a world model and Synthesizing high-fidelity and consistent novel views.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging