Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make   Student Better

Bojia Zi; Shihao Zhao; Xingjun Ma; Yu-Gang Jiang

arXiv:2108.07969·cs.CR·August 19, 2021·5 cites

Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better

Bojia Zi, Shihao Zhao, Xingjun Ma, Yu-Gang Jiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces RSLAD, a novel method that uses robust soft labels from large adversarially trained models to significantly improve the robustness of small neural networks against adversarial attacks.

Contribution

The paper proposes RSLAD, a new adversarial robustness distillation technique leveraging robust soft labels to enhance small model robustness, outperforming existing methods.

Findings

01

RSLAD improves small model robustness against state-of-the-art attacks.

02

Robust soft labels are crucial for effective adversarial robustness distillation.

03

RSLAD outperforms existing adversarial training and distillation methods.

Abstract

Adversarial training is one effective approach for training robust deep neural networks against adversarial attacks. While being able to bring reliable robustness, adversarial training (AT) methods in general favor high capacity models, i.e., the larger the model the better the robustness. This tends to limit their effectiveness on small models, which are more preferable in scenarios where storage or computing resources are very limited (e.g., mobile devices). In this paper, we leverage the concept of knowledge distillation to improve the robustness of small models by distilling from adversarially trained large models. We first revisit several state-of-the-art AT methods from a distillation perspective and identify one common technique that can lead to improved robustness: the use of robust soft labels -- predictions of a robust model. Following this observation, we propose a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zibojia/rslad
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications

MethodsKnowledge Distillation