3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image
Ze-Xin Yin, Liu Liu, Xinjie Wang, Wei Sui, Zhizhong Su, Jian Yang, Jin Xie

TL;DR
3D-Fixer introduces a coarse-to-fine in-place completion method for 3D scene generation from a single image, improving accuracy and efficiency by leveraging a novel dataset and a dual-branch network.
Contribution
It proposes a new in-place completion paradigm with a coarse-to-fine scheme and a dual-branch network, addressing boundary ambiguity and data scarcity in 3D scene reconstruction.
Findings
Achieves state-of-the-art geometric accuracy in 3D scene completion.
Outperforms baselines like MIDI and Gen3DSR in experiments.
Maintains efficiency comparable to diffusion-based methods.
Abstract
Compositional 3D scene generation from a single view requires the simultaneous recovery of scene layout and 3D assets. Existing approaches mainly fall into two categories: feed-forward generation methods and per-instance generation methods. The former directly predict 3D assets with explicit 6DoF poses through efficient network inference, but they generalize poorly to complex scenes. The latter improve generalization through a divide-and-conquer strategy, but suffer from time-consuming pose optimization. To bridge this gap, we introduce 3D-Fixer, a novel in-place completion paradigm. Specifically, 3D-Fixer extends 3D object generative priors to generate complete 3D assets conditioned on the partially visible point cloud at the original locations, which are cropped from the fragmented geometry obtained from the geometry estimation methods. Unlike prior works that require explicit pose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
