Reconstruction by Generation: 3D Multi-Object Scene Reconstruction from Sparse Observations

Andrii Zadaianchuk; Leonardo Barcellona; Lennard Schuenemann; Christian Gumbsch; Zehao Wang; Muhammad Zubair Irshad; Fabien Despinoy; Rahaf Aljundi; Stratis Gavves; Sergey Zakharov

arXiv:2604.27106·cs.CV·May 1, 2026

Reconstruction by Generation: 3D Multi-Object Scene Reconstruction from Sparse Observations

Andrii Zadaianchuk, Leonardo Barcellona, Lennard Schuenemann, Christian Gumbsch, Zehao Wang, Muhammad Zubair Irshad, Fabien Despinoy, Rahaf Aljundi, Stratis Gavves, Sergey Zakharov

PDF

TL;DR

RecGen is a generative framework that reconstructs complex multi-object scenes from sparse RGB-D observations, achieving state-of-the-art accuracy with fewer training data.

Contribution

It introduces a probabilistic joint estimation method leveraging compositional scene generation and strong 3D priors, generalizing across diverse objects and environments.

Findings

01

Outperforms previous methods by 30.1% in shape quality

02

Achieves 9.1% improvement in texture reconstruction

03

Improves pose estimation accuracy by 33.9%

Abstract

Accurately reconstructing complex full multi-object scenes from sparse observations remains a core challenge in computer vision and a key step toward scalable and reliable simulation for robotics. In this work, we introduce RecGen, a generative framework for probabilistic joint estimation of object and part shapes, as well as their pose under occlusion and partial visibility from one or multiple RGB-D images. By leveraging compositional synthetic scene generation and strong 3D shape priors, RecGen generalizes across diverse object types and real-world environments. RecGen achieves state-of-the-art performance on complex, heavily occluded datasets, robustly handling severe occlusions, symmetric objects, object parts, and intricate geometry and texture. Despite using nearly 80% fewer training meshes than the previous state of the art SAM3D, RecGen outperforms it by 30.1% in geometric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.