Occlusion-Robust Object Pose Estimation with Holistic Representation
Bo Chen, Tat-Jun Chin, Marius Klimavicius

TL;DR
This paper introduces a robust object pose estimation method that effectively handles occlusions by using novel data augmentation and holistic feature learning, outperforming existing approaches without requiring post-processing.
Contribution
It proposes a new occlusion-robust training technique and a holistic pose representation architecture that improve accuracy and coherence in object pose estimation under occlusions.
Findings
Outperforms state-of-the-art methods on LINEMOD dataset
Achieves superior results on YCB-Video dataset without refinement
Demonstrates high data efficiency in pose estimation
Abstract
Practical object pose estimation demands robustness against occlusions to the target object. State-of-the-art (SOTA) object pose estimators take a two-stage approach, where the first stage predicts 2D landmarks using a deep network and the second stage solves for 6DOF pose from 2D-3D correspondences. Albeit widely adopted, such two-stage approaches could suffer from novel occlusions when generalising and weak landmark coherence due to disrupted features. To address these issues, we develop a novel occlude-and-blackout batch augmentation technique to learn occlusion-robust deep features, and a multi-precision supervision architecture to encourage holistic pose representation learning for accurate and coherent landmark predictions. We perform careful ablation tests to verify the impact of our innovations and compare our method to SOTA pose estimators. Without the need of any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Occlusion-Robust Object Pose Estimation with Holistic Representation· youtube
Taxonomy
TopicsRobot Manipulation and Learning · Anatomy and Medical Technology · 3D Shape Modeling and Analysis
