Towards Universal Physical Attacks On Cascaded Camera-Lidar 3D Object   Detection Models

Mazen Abdelfattah; Kaiwen Yuan; Z. Jane Wang; Rabab Ward

arXiv:2101.10747·cs.CV·September 30, 2021

Towards Universal Physical Attacks On Cascaded Camera-Lidar 3D Object Detection Models

Mazen Abdelfattah, Kaiwen Yuan, Z. Jane Wang, Rabab Ward

PDF

TL;DR

This paper introduces a universal physical adversarial attack on cascaded multi-modal 3D object detection models for self-driving cars, demonstrating significant vulnerability by perturbing geometry and texture in both image and point cloud domains.

Contribution

It presents the first universal physical attack on cascaded multi-modal DNNs, combining geometry and texture perturbations to reduce detection accuracy.

Findings

01

Achieved a 73% reduction in car detection accuracy on KITTI benchmark.

02

Demonstrated effectiveness of multi-modal adversarial attacks in real-world scenarios.

03

Provided insights into the vulnerabilities of cascaded RGB and point cloud models.

Abstract

We propose a universal and physically realizable adversarial attack on a cascaded multi-modal deep learning network (DNN), in the context of self-driving cars. DNNs have achieved high performance in 3D object detection, but they are known to be vulnerable to adversarial attacks. These attacks have been heavily investigated in the RGB image domain and more recently in the point cloud domain, but rarely in both domains simultaneously - a gap to be filled in this paper. We use a single 3D mesh and differentiable rendering to explore how perturbing the mesh's geometry and texture can reduce the robustness of DNNs to adversarial attacks. We attack a prominent cascaded multi-modal DNN, the Frustum-Pointnet model. Using the popular KITTI benchmark, we showed that the proposed universal multi-modal attack was successful in reducing the model's ability to detect a car by nearly 73%. This work…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.