Theoretically Principled Trade-off between Robustness and Accuracy

Hongyang Zhang; Yaodong Yu; Jiantao Jiao; Eric P. Xing and; Laurent El Ghaoui; Michael I. Jordan

arXiv:1901.08573·cs.LG·June 25, 2019·921 cites

Theoretically Principled Trade-off between Robustness and Accuracy

Hongyang Zhang, Yaodong Yu, Jiantao Jiao, Eric P. Xing and, Laurent El Ghaoui, Michael I. Jordan

PDF

Open Access 5 Repos

TL;DR

This paper explores the fundamental trade-off between robustness and accuracy in adversarial defenses, providing a theoretical framework and introducing a new method, TRADES, that effectively balances these aspects.

Contribution

It offers a theoretical decomposition of adversarial error and proposes a new defense method, TRADES, inspired by this theory, achieving state-of-the-art results.

Findings

01

Theoretical upper bound on robust error is tight and distribution-independent.

02

TRADES outperforms previous methods in robustness-accuracy trade-offs.

03

Won first place in NeurIPS 2018 Adversarial Vision Challenge.

Abstract

We identify a trade-off between robustness and accuracy that serves as a guiding principle in the design of defenses against adversarial examples. Although this problem has been widely studied empirically, much remains unknown concerning the theory underlying this trade-off. In this work, we decompose the prediction error for adversarial examples (robust error) as the sum of the natural (classification) error and boundary error, and provide a differentiable upper bound using the theory of classification-calibrated loss, which is shown to be the tightest possible upper bound uniform over all probability distributions and measurable predictors. Inspired by our theoretical analysis, we also design a new defense method, TRADES, to trade adversarial robustness off against accuracy. Our proposed algorithm performs well experimentally in real-world datasets. The methodology is the foundation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Physical Unclonable Functions (PUFs) and Hardware Security