Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose Estimation
Markus Oberweger, Mahdi Rad, Vincent Lepetit

TL;DR
This paper presents a robust 3D object pose estimation method that uses patch-based heatmap predictions to handle large occlusions, outperforming existing approaches on challenging datasets.
Contribution
The authors introduce a patch-based heatmap prediction approach that improves robustness to occlusions in 3D pose estimation from single images.
Findings
Outperforms existing methods on Occluded LineMOD and YCB-Video datasets.
Patch-based heatmaps increase robustness to partial occlusions.
Effective handling of ambiguities in patch predictions.
Abstract
We introduce a novel method for robust and accurate 3D object pose estimation from a single color image under large occlusions. Following recent approaches, we first predict the 2D projections of 3D points related to the target object and then compute the 3D pose from these correspondences using a geometric method. Unfortunately, as the results of our experiments show, predicting these 2D projections using a regular CNN or a Convolutional Pose Machine is highly sensitive to partial occlusions, even when these methods are trained with partially occluded examples. Our solution is to predict heatmaps from multiple small patches independently and to accumulate the results to obtain accurate and robust predictions. Training subsequently becomes challenging because patches with similar appearances but different positions on the object correspond to different heatmaps. However, we provide a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
