GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li; Yueh-Feng Ku; Hong-Xuan Yen; Chi Liu; Yu-Lun Liu; Albert; Y. C. Chen; Cheng-Hao Kuo; Min Sun

arXiv:2407.12939·cs.CV·August 5, 2024·1 cites

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen, Chi Liu, Yu-Lun Liu, Albert, Y. C. Chen, Cheng-Hao Kuo, Min Sun

PDF

Open Access 1 Repo

TL;DR

GenRC is an automated, training-free pipeline that completes 3D room-scale meshes with high-fidelity textures by generating view-consistent panoramic RGBD images, ensuring global consistency without human-designed prompts.

Contribution

It introduces E-Diffusion for view-consistent image generation and uses textual inversion to maintain scene style, outperforming existing methods without dataset-specific training.

Findings

01

Outperforms state-of-the-art methods on ScanNet and ARKitScenes datasets.

02

Does not require training on target datasets or predefined camera trajectories.

03

Ensures global geometry and appearance consistency in 3D scene completion.

Abstract

Sparse RGBD scene completion is a challenging task especially when considering consistent textures and geometries throughout the entire scene. Different from existing solutions that rely on human-designed text prompts or predefined camera trajectories, we propose GenRC, an automated training-free pipeline to complete a room-scale 3D mesh with high-fidelity textures. To achieve this, we first project the sparse RGBD images to a highly incomplete 3D mesh. Instead of iteratively generating novel views to fill in the void, we utilized our proposed E-Diffusion to generate a view-consistent panoramic RGBD image which ensures global geometry and appearance consistency. Furthermore, we maintain the input-output scene stylistic consistency through textual inversion to replace human-designed text prompts. To bridge the domain gap among datasets, E-Diffusion leverages models trained on large-scale…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

minfenli/GenRC
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · 3D Shape Modeling and Analysis · Image Retrieval and Classification Techniques