How Does Bayes Error Limit Probabilistic Robust Accuracy
Ruihan Zhang, Jun Sun

TL;DR
This paper investigates the theoretical limits of probabilistic robustness in neural networks under adversarial attacks, revealing that it can achieve higher accuracy bounds than deterministic robustness and proposing methods to improve probabilistic robustness.
Contribution
It provides a theoretical analysis of the upper bounds of probabilistic robustness based on Bayes error, and demonstrates how probabilistic robustness can be optimized for better accuracy.
Findings
Probabilistic robustness has a higher upper bound than deterministic robustness.
Voting within the vicinity improves probabilistic robust accuracy.
Upper bound increases as the allowed probability of failure, κ, grows.
Abstract
Adversarial examples pose a security threat to many critical systems built on neural networks. Given that deterministic robustness often comes with significantly reduced accuracy, probabilistic robustness (i.e., the probability of having the same label with a vicinity is ) has been proposed as a promising way of achieving robustness whilst maintaining accuracy. However, existing training methods for probabilistic robustness still experience non-trivial accuracy loss. It is unclear whether there is an upper bound on the accuracy when optimising towards probabilistic robustness, and whether there is a certain relationship between and this bound. This work studies these problems from a Bayes error perspective. We find that while Bayes uncertainty does affect probabilistic robustness, its impact is smaller than that on deterministic robustness. This reduced Bayes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Statistical Methods and Models
MethodsALIGN
