ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations
Sravanti Addepalli, Priyam Dey, R. Venkatesh Babu

TL;DR
ProFeAT introduces a novel training framework that enhances self-supervised adversarial learning by aligning teacher-student objectives with a projection head, significantly improving robustness and accuracy over previous SSL-AT methods.
Contribution
The paper proposes Projected Feature Adversarial Training (ProFeAT), a new method that reduces the performance gap in SSL-AT by using a projection head and tailored losses, achieving state-of-the-art results.
Findings
Significant improvements in clean and robust accuracy.
ProFeAT outperforms existing SSL-AT methods.
Comparable or better performance than supervised-AT methods.
Abstract
The need for abundant labelled data in supervised Adversarial Training (AT) has prompted the use of Self-Supervised Learning (SSL) techniques with AT. However, the direct application of existing SSL methods to adversarial training has been sub-optimal due to the increased training complexity of combining SSL with AT. A recent approach, DeACL, mitigates this by utilizing supervision from a standard SSL teacher in a distillation setting, to mimic supervised AT. However, we find that there is still a large performance gap when compared to supervised adversarial training, specifically on larger models. In this work, investigate the key reason for this gap and propose Projected Feature Adversarial Training (ProFeAT) to bridge the same. We show that the sub-optimal distillation performance is a result of mismatch in training objectives of the teacher and student, and propose to use a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
