Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better
Bojia Zi, Shihao Zhao, Xingjun Ma, Yu-Gang Jiang

TL;DR
This paper introduces RSLAD, a novel method that uses robust soft labels from large adversarially trained models to significantly improve the robustness of small neural networks against adversarial attacks.
Contribution
The paper proposes RSLAD, a new adversarial robustness distillation technique leveraging robust soft labels to enhance small model robustness, outperforming existing methods.
Findings
RSLAD improves small model robustness against state-of-the-art attacks.
Robust soft labels are crucial for effective adversarial robustness distillation.
RSLAD outperforms existing adversarial training and distillation methods.
Abstract
Adversarial training is one effective approach for training robust deep neural networks against adversarial attacks. While being able to bring reliable robustness, adversarial training (AT) methods in general favor high capacity models, i.e., the larger the model the better the robustness. This tends to limit their effectiveness on small models, which are more preferable in scenarios where storage or computing resources are very limited (e.g., mobile devices). In this paper, we leverage the concept of knowledge distillation to improve the robustness of small models by distilling from adversarially trained large models. We first revisit several state-of-the-art AT methods from a distillation perspective and identify one common technique that can lead to improved robustness: the use of robust soft labels -- predictions of a robust model. Following this observation, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
MethodsKnowledge Distillation
