On Evaluating Adversarial Robustness
Nicholas Carlini, Anish Athalye, Nicolas Papernot, Wieland Brendel,, Jonas Rauber, Dimitris Tsipras, Ian Goodfellow, Aleksander Madry, Alexey, Kurakin

TL;DR
This paper discusses the challenges in properly evaluating adversarial defenses, reviews current best practices, and proposes new methods to improve the reliability of security assessments against adversarial attacks.
Contribution
It provides a comprehensive review of evaluation methodologies and introduces new approaches to enhance the robustness assessment of adversarial defenses.
Findings
Current evaluation methods often lead to incorrect conclusions
Best practices are essential for reliable security assessments
Proposed new evaluation methods aim to improve robustness validation
Abstract
Correctly evaluating defenses against adversarial examples has proven to be extremely difficult. Despite the significant amount of recent work attempting to design defenses that withstand adaptive attacks, few have succeeded; most papers that propose defenses are quickly shown to be incorrect. We believe a large contributing factor is the difficulty of performing security evaluations. In this paper, we discuss the methodological foundations, review commonly accepted best practices, and suggest new methods for evaluating defenses to adversarial examples. We hope that both researchers developing defenses as well as readers and reviewers who wish to understand the completeness of an evaluation consider our advice in order to avoid common pitfalls.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Advanced Malware Detection Techniques
