ProFeAT: Projected Feature Adversarial Training for Self-Supervised   Learning of Robust Representations

Sravanti Addepalli; Priyam Dey; R. Venkatesh Babu

arXiv:2406.05796·cs.LG·June 11, 2024·1 cites

ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations

Sravanti Addepalli, Priyam Dey, R. Venkatesh Babu

PDF

Open Access

TL;DR

ProFeAT introduces a novel training framework that enhances self-supervised adversarial learning by aligning teacher-student objectives with a projection head, significantly improving robustness and accuracy over previous SSL-AT methods.

Contribution

The paper proposes Projected Feature Adversarial Training (ProFeAT), a new method that reduces the performance gap in SSL-AT by using a projection head and tailored losses, achieving state-of-the-art results.

Findings

01

Significant improvements in clean and robust accuracy.

02

ProFeAT outperforms existing SSL-AT methods.

03

Comparable or better performance than supervised-AT methods.

Abstract

The need for abundant labelled data in supervised Adversarial Training (AT) has prompted the use of Self-Supervised Learning (SSL) techniques with AT. However, the direct application of existing SSL methods to adversarial training has been sub-optimal due to the increased training complexity of combining SSL with AT. A recent approach, DeACL, mitigates this by utilizing supervision from a standard SSL teacher in a distillation setting, to mimic supervised AT. However, we find that there is still a large performance gap when compared to supervised adversarial training, specifically on larger models. In this work, investigate the key reason for this gap and propose Projected Feature Adversarial Training (ProFeAT) to bridge the same. We show that the sub-optimal distillation performance is a result of mismatch in training objectives of the teacher and student, and propose to use a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis