Multi-view object pose estimation from correspondence distributions and epipolar geometry
Rasmus Laurvig Haugaard, Thorbj{\o}rn Mosekj{\ae}r Iversen

TL;DR
This paper introduces a multi-view pose estimation approach that aggregates learned correspondence distributions and epipolar geometry to significantly improve accuracy over single-view methods, especially in ambiguous scenarios.
Contribution
The proposed method uniquely combines learned 2D-3D correspondence distributions with epipolar constraints for multi-view pose estimation, handling visual ambiguities effectively.
Findings
Reduces pose estimation errors by 80-91% compared to single-view methods.
Achieves state-of-the-art results on T-LESS with four views.
Outperforms methods using more views in certain cases.
Abstract
In many automation tasks involving manipulation of rigid objects, the poses of the objects must be acquired. Vision-based pose estimation using a single RGB or RGB-D sensor is especially popular due to its broad applicability. However, single-view pose estimation is inherently limited by depth ambiguity and ambiguities imposed by various phenomena like occlusion, self-occlusion, reflections, etc. Aggregation of information from multiple views can potentially resolve these ambiguities, but the current state-of-the-art multi-view pose estimation method only uses multiple views to aggregate single-view pose estimates, and thus rely on obtaining good single-view estimates. We present a multi-view pose estimation method which aggregates learned 2D-3D distributions from multiple views for both the initial estimate and optional refinement. Our method performs probabilistic sampling of 3D-3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Advanced Neural Network Applications
