REGroup: Rank-aggregating Ensemble of Generative Classifiers for Robust Predictions
Lokender Tiwari, Anish Madan, Saket Anand, Subhashis Banerjee

TL;DR
This paper introduces REGroup, an ensemble of generative classifiers based on intermediate neural responses, which enhances robustness against adversarial attacks without retraining or fine-tuning, achieving state-of-the-art results on ImageNet.
Contribution
The paper proposes a novel ensemble method using intermediate-layer responses to improve adversarial robustness without retraining or architecture modifications.
Findings
Achieves state-of-the-art adversarial defense on ImageNet
Ensemble of generative classifiers improves robustness
Method is architecture-agnostic and attack-agnostic
Abstract
Deep Neural Networks (DNNs) are often criticized for being susceptible to adversarial attacks. Most successful defense strategies adopt adversarial training or random input transformations that typically require retraining or fine-tuning the model to achieve reasonable performance. In this work, our investigations of intermediate representations of a pre-trained DNN lead to an interesting discovery pointing to intrinsic robustness to adversarial attacks. We find that we can learn a generative classifier by statistically characterizing the neural response of an intermediate layer to clean training samples. The predictions of multiple such intermediate-layer based classifiers, when aggregated, show unexpected robustness to adversarial attacks. Specifically, we devise an ensemble of these generative classifiers that rank-aggregates their predictions via a Borda count-based consensus. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
REGroup: Rank-aggregating Ensemble of Generative Classifiers for Robust Predictions· youtube
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Anomaly Detection Techniques and Applications
