Noise-Response Analysis of Deep Neural Networks Quantifies Robustness and Fingerprints Structural Malware
N. Benjamin Erichson, Dane Taylor, Qixuan Wu, Michael W. Mahoney

TL;DR
This paper introduces a rapid noise-response analysis method to detect structural malware in deep neural networks by examining their sensitivity to noise, significantly improving detection speed over existing techniques.
Contribution
The authors propose a novel noise-response fingerprinting technique that efficiently detects backdoors in DNNs by analyzing their robustness to noise, offering a faster alternative to current methods.
Findings
Backdoored DNNs are more sensitive to input noise.
The method accurately detects backdoors with high confidence.
Detection is achieved in seconds, much faster than existing approaches.
Abstract
The ubiquity of deep neural networks (DNNs), cloud-based training, and transfer learning is giving rise to a new cybersecurity frontier in which unsecure DNNs have `structural malware' (i.e., compromised weights and activation pathways). In particular, DNNs can be designed to have backdoors that allow an adversary to easily and reliably fool an image classifier by adding a pattern of pixels called a trigger. It is generally difficult to detect backdoors, and existing detection methods are computationally expensive and require extensive resources (e.g., access to the training data). Here, we propose a rapid feature-generation technique that quantifies the robustness of a DNN, `fingerprints' its nonlinearity, and allows us to detect backdoors (if present). Our approach involves studying how a DNN responds to noise-infused images with varying noise intensity, which we summarize with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
