$n$-ML: Mitigating Adversarial Examples via Ensembles of Topologically Manipulated Classifiers
Mahmood Sharif, Lujo Bauer, Michael K. Reiter

TL;DR
This paper introduces $n$-ML, an ensemble-based defense mechanism that trains classifiers to specifically resist adversarial examples, improving robustness while maintaining high accuracy and efficiency.
Contribution
The paper presents a novel ensemble training approach that enhances adversarial robustness by training classifiers to classify adversarial inputs differently.
Findings
Retains high accuracy on benign data
Provides stronger adversarial defense than existing methods
Achieves lower classification-time overhead
Abstract
This paper proposes a new defense called -ML against adversarial examples, i.e., inputs crafted by perturbing benign inputs by small amounts to induce misclassifications by classifiers. Inspired by -version programming, -ML trains an ensemble of classifiers, and inputs are classified by a vote of the classifiers in the ensemble. Unlike prior such approaches, however, the classifiers in the ensemble are trained specifically to classify adversarial examples differently, rendering it very difficult for an adversarial example to obtain enough votes to be misclassified. We show that -ML roughly retains the benign classification accuracies of state-of-the-art models on the MNIST, CIFAR10, and GTSRB datasets, while simultaneously defending against adversarial examples with better resilience than the best defenses known to date and, in most cases, with lower classification-time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques
