NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models
Francesco Milano, Jen Jen Chung, Hermann Blum, Roland Siegwart, Lionel Ott

TL;DR
NeuSurfEmb introduces a CAD-model-free pipeline for 6D object pose estimation using a novel NeuS2 representation, semi-automated data generation, and a correspondence-based estimator, achieving competitive results with less setup.
Contribution
The paper presents a fully pipeline that eliminates the need for CAD models and PBR, leveraging NeuS2 and semi-automated photorealistic data synthesis for robust pose estimation.
Findings
Competitive performance on LINEMOD-Occlusion dataset.
Outperforms CAD-model-free approaches in accuracy and robustness.
Eases deployment in real-world scenarios with minimal manual setup.
Abstract
State-of-the-art approaches for 6D object pose estimation assume the availability of CAD models and require the user to manually set up physically-based rendering (PBR) pipelines for synthetic training data generation. Both factors limit the application of these methods in real-world scenarios. In this work, we present a pipeline that does not require CAD models and allows training a state-of-the-art pose estimator requiring only a small set of real images as input. Our method is based on a NeuS2 object representation, that we learn through a semi-automated procedure based on Structure-from-Motion (SfM) and object-agnostic segmentation. We exploit the novel-view synthesis ability of NeuS2 and simple cut-and-paste augmentation to automatically generate photorealistic object renderings, which we use to train the correspondence-based SurfEmb pose estimator. We evaluate our method on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Hand Gesture Recognition Systems
MethodsSparse Evolutionary Training
