TL;DR
Pix2Pose is a novel RGB-based 6D object pose estimation method that predicts pixel-wise 3D coordinates, effectively handling occlusions and symmetries without textured 3D models, outperforming existing approaches.
Contribution
The paper introduces Pix2Pose, a new approach that predicts pixel-wise 3D coordinates for objects, incorporating a transformer loss for symmetry and using generative adversarial training for occlusion robustness.
Findings
Outperforms state-of-the-art methods on benchmark datasets
Effectively handles occlusion and symmetry issues
Operates solely on RGB images without textured models
Abstract
Estimating the 6D pose of objects using only RGB images remains challenging because of problems such as occlusion and symmetries. It is also difficult to construct 3D models with precise texture without expert knowledge or specialized scanning devices. To address these problems, we propose a novel pose estimation method, Pix2Pose, that predicts the 3D coordinates of each object pixel without textured models. An auto-encoder architecture is designed to estimate the 3D coordinates and expected errors per pixel. These pixel-wise predictions are then used in multiple stages to form 2D-3D correspondences to directly compute poses with the PnP algorithm with RANSAC iterations. Our method is robust to occlusion by leveraging recent achievements in generative adversarial training to precisely recover occluded parts. Furthermore, a novel loss function, the transformer loss, is proposed to handle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax
