MORA: Improving Ensemble Robustness Evaluation with Model-Reweighing Attack
Yunrui Yu, Xitong Gao, Cheng-Zhong Xu

TL;DR
This paper introduces MORA, a new gradient-reweighing attack that more accurately evaluates ensemble neural network defenses, revealing many are less robust than previously estimated.
Contribution
The paper presents MORA, a novel attack method that outperforms existing attacks in evaluating ensemble robustness and exposes overestimations in current defense assessments.
Findings
Most ensemble defenses are significantly less robust than prior estimates.
MORA achieves higher attack success rates and faster convergence than state-of-the-art attacks.
Ensemble defenses show near-zero robustness against MORA with small perturbations.
Abstract
Adversarial attacks can deceive neural networks by adding tiny perturbations to their input data. Ensemble defenses, which are trained to minimize attack transferability among sub-models, offer a promising research direction to improve robustness against such attacks while maintaining a high accuracy on natural inputs. We discover, however, that recent state-of-the-art (SOTA) adversarial attack strategies cannot reliably evaluate ensemble defenses, sizeably overestimating their robustness. This paper identifies the two factors that contribute to this behavior. First, these defenses form ensembles that are notably difficult for existing gradient-based method to attack, due to gradient obfuscation. Second, ensemble defenses diversify sub-model gradients, presenting a challenge to defeat all sub-models simultaneously, simply summing their contributions may counteract the overall attack…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications
