Towards Universal Physical Attacks On Cascaded Camera-Lidar 3D Object Detection Models
Mazen Abdelfattah, Kaiwen Yuan, Z. Jane Wang, Rabab Ward

TL;DR
This paper introduces a universal physical adversarial attack on cascaded multi-modal 3D object detection models for self-driving cars, demonstrating significant vulnerability by perturbing geometry and texture in both image and point cloud domains.
Contribution
It presents the first universal physical attack on cascaded multi-modal DNNs, combining geometry and texture perturbations to reduce detection accuracy.
Findings
Achieved a 73% reduction in car detection accuracy on KITTI benchmark.
Demonstrated effectiveness of multi-modal adversarial attacks in real-world scenarios.
Provided insights into the vulnerabilities of cascaded RGB and point cloud models.
Abstract
We propose a universal and physically realizable adversarial attack on a cascaded multi-modal deep learning network (DNN), in the context of self-driving cars. DNNs have achieved high performance in 3D object detection, but they are known to be vulnerable to adversarial attacks. These attacks have been heavily investigated in the RGB image domain and more recently in the point cloud domain, but rarely in both domains simultaneously - a gap to be filled in this paper. We use a single 3D mesh and differentiable rendering to explore how perturbing the mesh's geometry and texture can reduce the robustness of DNNs to adversarial attacks. We attack a prominent cascaded multi-modal DNN, the Frustum-Pointnet model. Using the popular KITTI benchmark, we showed that the proposed universal multi-modal attack was successful in reducing the model's ability to detect a car by nearly 73%. This work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
