Overload: Latency Attacks on Object Detection for Edge Devices
Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-rung Lee

TL;DR
This paper introduces Overload, a framework for latency attacks on object detection models that significantly increase inference time, posing a new threat to edge devices with limited resources.
Contribution
We propose a novel latency attack framework using spatial attention and optimization, demonstrating its effectiveness on YOLOv5 and highlighting its threat to real-time object detection.
Findings
Inference time increased tenfold with attacks
Attack is more effective and simpler than existing methods
Threat extends to all NMS-based object detection systems
Abstract
Nowadays, the deployment of deep learning-based applications is an essential task owing to the increasing demands on intelligent services. In this paper, we investigate latency attacks on deep learning applications. Unlike common adversarial attacks for misclassification, the goal of latency attacks is to increase the inference time, which may stop applications from responding to the requests within a reasonable time. This kind of attack is ubiquitous for various applications, and we use object detection to demonstrate how such kind of attacks work. We also design a framework named Overload to generate latency attacks at scale. Our method is based on a newly formulated optimization problem and a novel technique, called spatial attention. This attack serves to escalate the required computing costs during the inference time, consequently leading to an extended inference time for object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Memory and Neural Computing · Advanced Neural Network Applications
