MORA: Improving Ensemble Robustness Evaluation with Model-Reweighing   Attack

Yunrui Yu; Xitong Gao; Cheng-Zhong Xu

arXiv:2211.08008·cs.LG·November 16, 2022·5 cites

MORA: Improving Ensemble Robustness Evaluation with Model-Reweighing Attack

Yunrui Yu, Xitong Gao, Cheng-Zhong Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces MORA, a new gradient-reweighing attack that more accurately evaluates ensemble neural network defenses, revealing many are less robust than previously estimated.

Contribution

The paper presents MORA, a novel attack method that outperforms existing attacks in evaluating ensemble robustness and exposes overestimations in current defense assessments.

Findings

01

Most ensemble defenses are significantly less robust than prior estimates.

02

MORA achieves higher attack success rates and faster convergence than state-of-the-art attacks.

03

Ensemble defenses show near-zero robustness against MORA with small perturbations.

Abstract

Adversarial attacks can deceive neural networks by adding tiny perturbations to their input data. Ensemble defenses, which are trained to minimize attack transferability among sub-models, offer a promising research direction to improve robustness against such attacks while maintaining a high accuracy on natural inputs. We discover, however, that recent state-of-the-art (SOTA) adversarial attack strategies cannot reliably evaluate ensemble defenses, sizeably overestimating their robustness. This paper identifies the two factors that contribute to this behavior. First, these defenses form ensembles that are notably difficult for existing gradient-based method to attack, due to gradient obfuscation. Second, ensemble defenses diversify sub-model gradients, presenting a challenge to defeat all sub-models simultaneously, simply summing their contributions may counteract the overall attack…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lafeat/mora
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications