Model Extraction and Adversarial Attacks on Neural Networks using Switching Power Information
Tommy Li, Cory Merkel

TL;DR
This paper investigates how side-channel information, specifically switching power consumption, can aid in extracting neural network models and generating adversarial attacks, revealing security vulnerabilities in hardware implementations.
Contribution
It demonstrates that power consumption data improves model extraction fidelity by up to 30%, highlighting a new side-channel attack vector on neural networks.
Findings
Power data increases surrogate model accuracy by 30%.
Transferability of adversarial examples remains unaffected.
Side-channel info can aid in neural network security analysis.
Abstract
Artificial neural networks (ANNs) have gained significant popularity in the last decade for solving narrow AI problems in domains such as healthcare, transportation, and defense. As ANNs become more ubiquitous, it is imperative to understand their associated safety, security, and privacy vulnerabilities. Recently, it has been shown that ANNs are susceptible to a number of adversarial evasion attacks--inputs that cause the ANN to make high-confidence misclassifications despite being almost indistinguishable from the data used to train and test the network. This work explores to what degree finding these examples maybe aided by using side-channel information, specifically switching power consumption, of hardware implementations of ANNs. A black-box threat scenario is assumed, where an attacker has access to the ANN hardware's input, outputs, and topology, but the trained model parameters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Cardiac Arrest and Resuscitation · Security and Verification in Computing
