Adversarial samples for deep monocular 6D object pose estimation
Jinlai Zhang, Weiming Li, Shuang Liang, Hao Wang, Jihong Zhu

TL;DR
This paper introduces U6DA, a novel adversarial attack method that can fool state-of-the-art deep learning models in 6D object pose estimation from RGB images, highlighting robustness issues.
Contribution
The work presents the first study of adversarial samples for 6D pose estimation, proposing U6DA to attack multiple models and introducing a new robustness dataset.
Findings
U6DA effectively fools 6D pose models with imperceptible perturbations.
Adversarial samples transfer across different models and defenses.
The method reveals significant robustness vulnerabilities in current 6D pose estimation models.
Abstract
Estimating 6D object pose from an RGB image is important for many real-world applications such as autonomous driving and robotic grasping. Recent deep learning models have achieved significant progress on this task but their robustness received little research attention. In this work, for the first time, we study adversarial samples that can fool deep learning models with imperceptible perturbations to input image. In particular, we propose a Unified 6D pose estimation Attack, namely U6DA, which can successfully attack several state-of-the-art (SOTA) deep learning models for 6D pose estimation. The key idea of our U6DA is to fool the models to predict wrong results for object instance localization and shape that are essential for correct 6D pose estimation. Specifically, we explore a transfer-based black-box attack to 6D pose estimation. We design the U6DA loss to guide the generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Robot Manipulation and Learning
