Rethinking Clustering for Robustness
Motasem Alfarra, Juan C. P\'erez, Adel Bibi, Ali Thabet, Pablo, Arbel\'aez, Bernard Ghanem

TL;DR
This paper introduces ClusTR, a clustering-based training framework that enhances neural network robustness without adversarial training, outperforming traditional methods under strong attacks by leveraging semantic features.
Contribution
It provides a robustness certificate for clustering models and proposes ClusTR, a novel adversary-free training method that improves robustness by aligning features semantically.
Findings
ClusTR outperforms adversarial training by up to 4% under PGD attacks.
Provides a tight robustness certificate for clustering-based classifiers.
Establishes a connection from semantic features to robustness in neural networks.
Abstract
This paper studies how encouraging semantically-aligned features during deep neural network training can increase network robustness. Recent works observed that Adversarial Training leads to robust models, whose learnt features appear to correlate with human perception. Inspired by this connection from robustness to semantics, we study the complementary connection: from semantics to robustness. To do so, we provide a robustness certificate for distance-based classification models (clustering-based classifiers). Moreover, we show that this certificate is tight, and we leverage it to propose ClusTR (Clustering Training for Robustness), a clustering-based and adversary-free training framework to learn robust models. Interestingly, \textit{ClusTR} outperforms adversarially-trained networks by up to under strong PGD attacks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
