Feature Distillation With Guided Adversarial Contrastive Learning

Tao Bai; Jinnan Chen; Jun Zhao; Bihan Wen; Xudong Jiang; Alex Kot

arXiv:2009.09922·cs.LG·September 22, 2020·1 cites

Feature Distillation With Guided Adversarial Contrastive Learning

Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot

PDF

Open Access

TL;DR

This paper introduces Guided Adversarial Contrastive Distillation (GACD), a novel method that transfers adversarial robustness from teacher to student models using feature-based contrastive learning and sample reweighting, achieving effective robustness transfer across models and tasks.

Contribution

The paper proposes GACD, a new feature distillation approach that enhances adversarial robustness transfer through contrastive learning and reweighted sample estimation, outperforming existing methods.

Findings

01

GACD effectively transfers robustness across models and tasks.

02

Students learn more robust features and structural knowledge from teachers.

03

Experimental results show GACD achieves comparable or better robustness than existing methods.

Abstract

Deep learning models are shown to be vulnerable to adversarial examples. Though adversarial training can enhance model robustness, typical approaches are computationally expensive. Recent works proposed to transfer the robustness to adversarial attacks across different tasks or models with soft labels.Compared to soft labels, feature contains rich semantic information and holds the potential to be applied to different downstream tasks. In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features. We first formulate this objective as contrastive learning and connect it with mutual information. With a well-trained teacher model as an anchor, students are expected to extract features similar to the teacher. Then considering the potential errors made by teachers, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning