Resilience from Diversity: Population-based approach to harden models against adversarial attacks
Jasser Jasser, Ivan Garibay

TL;DR
This paper proposes a population-based, diverse ensemble approach with random submodel selection and counter linking to improve deep learning models' resilience against adversarial attacks, achieving significant robustness gains.
Contribution
Introduces a Counter-Linked Model (CLM) that maintains diversity among submodels and uses randomization to enhance adversarial robustness, combined with adversarial training for state-of-the-art defense.
Findings
Enhanced robustness by around 20% on MNIST
Achieved at least 15% robustness improvement on CIFAR-10
Coupling with adversarial training yields state-of-the-art results
Abstract
Traditional deep learning networks (DNN) exhibit intriguing vulnerabilities that allow an attacker to force them to fail at their task. Notorious attacks such as the Fast Gradient Sign Method (FGSM) and the more powerful Projected Gradient Descent (PGD) generate adversarial samples by adding a magnitude of perturbation to the input's computed gradient, resulting in a deterioration of the effectiveness of the model's classification. This work introduces a model that is resilient to adversarial attacks. Our model leverages an established mechanism of defense which utilizes randomness and a population of DNNs. More precisely, our model consists of a population of diverse submodels, each one of them trained to individually obtain a high accuracy for the task at hand, while forced to maintain meaningful differences in their weights. Each time our model receives a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
