A Divide et Impera Approach for 3D Shape Reconstruction from Multiple Views
Riccardo Spezialetti, David Joseph Tan, Alessio Tonioni, Keisuke, Tateno, Federico Tombari

TL;DR
This paper introduces a viewpoint-variant 3D shape reconstruction method from multiple images that merges visible information from different views, improving detail preservation over traditional viewpoint-invariant approaches.
Contribution
It proposes a three-step approach combining pose estimation, voxel carving, and refinement to enhance 3D reconstruction from multiple views.
Findings
Effective pose estimation and shape reconstruction on ShapeNet benchmark
Improved detail preservation from multiple viewpoints
Robustness to occlusions and view variations
Abstract
Estimating the 3D shape of an object from a single or multiple images has gained popularity thanks to the recent breakthroughs powered by deep learning. Most approaches regress the full object shape in a canonical pose, possibly extrapolating the occluded parts based on the learned priors. However, their viewpoint invariant technique often discards the unique structures visible from the input images. In contrast, this paper proposes to rely on viewpoint variant reconstructions by merging the visible information from the given views. Our approach is divided into three steps. Starting from the sparse views of the object, we first align them into a common coordinate system by estimating the relative pose between all the pairs. Then, inspired by the traditional voxel carving, we generate an occupancy grid of the object taken from the silhouette on the images and their relative poses.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
