Attacking Binarized Neural Networks
Angus Galloway, Graham W. Taylor, Medhat Moussa

TL;DR
This paper investigates the robustness of binarized neural networks, showing that very low-precision models can improve resistance to adversarial attacks, but also highlighting potential pitfalls like gradient masking.
Contribution
It demonstrates that binarized neural networks can enhance adversarial robustness and clarifies the limitations of gradient masking in such models.
Findings
Stochastic quantization of weights reduces attack effectiveness
Binary neural networks can match full-precision robustness in worst cases
Gradient masking is a false security in binary models
Abstract
Neural networks with low-precision weights and activations offer compelling efficiency advantages over their full-precision equivalents. The two most frequently discussed benefits of quantization are reduced memory consumption, and a faster forward pass when implemented with efficient bitwise operations. We propose a third benefit of very low-precision neural networks: improved robustness against some adversarial attacks, and in the worst case, performance that is on par with full-precision models. We focus on the very low-precision case where weights and activations are both quantized to 1, and note that stochastically quantizing weights in just one layer can sharply reduce the impact of iterative attacks. We observe that non-scaled binary neural networks exhibit a similar effect to the original defensive distillation procedure that led to gradient masking, and a false notion of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
