Feature Distillation With Guided Adversarial Contrastive Learning
Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot

TL;DR
This paper introduces Guided Adversarial Contrastive Distillation (GACD), a novel method that transfers adversarial robustness from teacher to student models using feature-based contrastive learning and sample reweighting, achieving effective robustness transfer across models and tasks.
Contribution
The paper proposes GACD, a new feature distillation approach that enhances adversarial robustness transfer through contrastive learning and reweighted sample estimation, outperforming existing methods.
Findings
GACD effectively transfers robustness across models and tasks.
Students learn more robust features and structural knowledge from teachers.
Experimental results show GACD achieves comparable or better robustness than existing methods.
Abstract
Deep learning models are shown to be vulnerable to adversarial examples. Though adversarial training can enhance model robustness, typical approaches are computationally expensive. Recent works proposed to transfer the robustness to adversarial attacks across different tasks or models with soft labels.Compared to soft labels, feature contains rich semantic information and holds the potential to be applied to different downstream tasks. In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features. We first formulate this objective as contrastive learning and connect it with mutual information. With a well-trained teacher model as an anchor, students are expected to extract features similar to the teacher. Then considering the potential errors made by teachers, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
