On the robustness of randomized classifiers to adversarial examples

Rafael Pinot; Laurent Meunier; Florian Yger; C\'edric; Gouy-Pailler; Yann Chevaleyre; Jamal Atif

arXiv:2102.10875·cs.LG·February 23, 2021

On the robustness of randomized classifiers to adversarial examples

Rafael Pinot, Laurent Meunier, Florian Yger, C\'edric, Gouy-Pailler, Yann Chevaleyre, Jamal Atif

PDF

Open Access

TL;DR

This paper analyzes the robustness of randomized classifiers against adversarial attacks, introducing new theoretical bounds and a noise injection method, with experiments demonstrating improved robustness and accuracy on image datasets.

Contribution

It introduces a new robustness notion for randomized classifiers, derives bounds on adversarial generalization, and proposes an effective noise injection technique for robustness.

Findings

01

New bounds on adversarial generalization gap.

02

Effective noise injection method for robust classifiers.

03

Deep neural networks achieve high accuracy with guaranteed robustness.

Abstract

This paper investigates the theory of robustness against adversarial attacks. We focus on randomized classifiers (\emph{i.e.} classifiers that output random variables) and provide a thorough analysis of their behavior through the lens of statistical learning theory and information theory. To this aim, we introduce a new notion of robustness for randomized classifiers, enforcing local Lipschitzness using probability metrics. Equipped with this definition, we make two new contributions. The first one consists in devising a new upper bound on the adversarial generalization gap of randomized classifiers. More precisely, we devise bounds on the generalization gap and the adversarial gap (\emph{i.e.} the gap between the risk and the worst-case risk under attack) of randomized classifiers. The second contribution presents a yet simple but efficient noise injection method to design robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications