On the robustness of randomized classifiers to adversarial examples
Rafael Pinot, Laurent Meunier, Florian Yger, C\'edric, Gouy-Pailler, Yann Chevaleyre, Jamal Atif

TL;DR
This paper analyzes the robustness of randomized classifiers against adversarial attacks, introducing new theoretical bounds and a noise injection method, with experiments demonstrating improved robustness and accuracy on image datasets.
Contribution
It introduces a new robustness notion for randomized classifiers, derives bounds on adversarial generalization, and proposes an effective noise injection technique for robustness.
Findings
New bounds on adversarial generalization gap.
Effective noise injection method for robust classifiers.
Deep neural networks achieve high accuracy with guaranteed robustness.
Abstract
This paper investigates the theory of robustness against adversarial attacks. We focus on randomized classifiers (\emph{i.e.} classifiers that output random variables) and provide a thorough analysis of their behavior through the lens of statistical learning theory and information theory. To this aim, we introduce a new notion of robustness for randomized classifiers, enforcing local Lipschitzness using probability metrics. Equipped with this definition, we make two new contributions. The first one consists in devising a new upper bound on the adversarial generalization gap of randomized classifiers. More precisely, we devise bounds on the generalization gap and the adversarial gap (\emph{i.e.} the gap between the risk and the worst-case risk under attack) of randomized classifiers. The second contribution presents a yet simple but efficient noise injection method to design robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
