GenRC: Generative 3D Room Completion from Sparse Image Collections
Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen, Chi Liu, Yu-Lun Liu, Albert, Y. C. Chen, Cheng-Hao Kuo, Min Sun

TL;DR
GenRC is an automated, training-free pipeline that completes 3D room-scale meshes with high-fidelity textures by generating view-consistent panoramic RGBD images, ensuring global consistency without human-designed prompts.
Contribution
It introduces E-Diffusion for view-consistent image generation and uses textual inversion to maintain scene style, outperforming existing methods without dataset-specific training.
Findings
Outperforms state-of-the-art methods on ScanNet and ARKitScenes datasets.
Does not require training on target datasets or predefined camera trajectories.
Ensures global geometry and appearance consistency in 3D scene completion.
Abstract
Sparse RGBD scene completion is a challenging task especially when considering consistent textures and geometries throughout the entire scene. Different from existing solutions that rely on human-designed text prompts or predefined camera trajectories, we propose GenRC, an automated training-free pipeline to complete a room-scale 3D mesh with high-fidelity textures. To achieve this, we first project the sparse RGBD images to a highly incomplete 3D mesh. Instead of iteratively generating novel views to fill in the void, we utilized our proposed E-Diffusion to generate a view-consistent panoramic RGBD image which ensures global geometry and appearance consistency. Furthermore, we maintain the input-output scene stylistic consistency through textual inversion to replace human-designed text prompts. To bridge the domain gap among datasets, E-Diffusion leverages models trained on large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · 3D Shape Modeling and Analysis · Image Retrieval and Classification Techniques
