Few-View Object Reconstruction with Unknown Categories and Camera Poses

Hanwen Jiang; Zhenyu Jiang; Kristen Grauman; Yuke Zhu

arXiv:2212.04492·cs.CV·January 29, 2024·5 cites

Few-View Object Reconstruction with Unknown Categories and Camera Poses

Hanwen Jiang, Zhenyu Jiang, Kristen Grauman, Yuke Zhu

PDF

Open Access 1 Repo

TL;DR

This paper introduces FORGE, a unified method for reconstructing 3D objects from few images without known camera poses or categories, combining shape reconstruction and pose estimation to work effectively in real-world scenarios.

Contribution

The paper presents a novel unified approach that jointly estimates camera poses and reconstructs 3D shapes from limited views without prior category knowledge.

Findings

01

Reliable reconstruction from five views.

02

Outperforms existing pose estimation methods.

03

Comparable results using predicted and ground-truth poses.

Abstract

While object reconstruction has made great strides in recent years, current methods typically require densely captured images and/or known camera poses, and generalize poorly to novel object categories. To step toward object reconstruction in the wild, this work explores reconstructing general real-world objects from a few images without known camera poses or object categories. The crux of our work is solving two fundamental 3D vision problems -- shape reconstruction and pose estimation -- in a unified approach. Our approach captures the synergies of these two problems: reliable camera pose estimation gives rise to accurate shape reconstruction, and the accurate reconstruction, in turn, induces robust correspondence between different views and facilitates pose estimation. Our method FORGE predicts 3D features from each view and leverages them in conjunction with the input images to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ut-austin-rpl/forge
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Surveying and Cultural Heritage · Robotics and Sensor-Based Localization