Multimodal Adversarial Quality Policy for Safe Grasping
Kunlin Xie, Chenghao Li, Haolan Zhang, Nak Young Chong

TL;DR
This paper introduces MAQP, a multimodal adversarial policy that enhances the safety of vision-guided robot grasping by effectively managing RGB and depth data through novel optimization and balancing strategies.
Contribution
The work presents a novel multimodal adversarial framework with dual-patch optimization and gradient balancing strategies for safer robot grasping in RGBD environments.
Findings
MAQP improves safety in vision-guided grasping tasks.
Experimental results show enhanced robustness against adversarial attacks.
Framework achieves superior performance on benchmark datasets and real cobots.
Abstract
Vision-guided robot grasping based on Deep Neural Networks (DNNs) generalizes well but poses safety risks in the Human-Robot Interaction (HRI). Recent works solved it by designing benign adversarial attacks and patches with RGB modality, yet depth-independent characteristics limit their effectiveness on RGBD modality. In this work, we propose the Multimodal Adversarial Quality Policy (MAQP) to realize multimodal safe grasping. Our framework introduces two key components. First, the Heterogeneous Dual-Patch Optimization Scheme (HDPOS) mitigates the distribution discrepancy between RGB and depth modalities in patch generation by adopting modality-specific initialization strategies, employing a Gaussian distribution for depth patches and a uniform distribution for RGB patches, while jointly optimizing both modalities under a unified objective function. Second, the Gradient-Level Modality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Adversarial Robustness in Machine Learning · Social Robot Interaction and HRI
