Integrating uncertainty quantification into randomized smoothing based robustness guarantees
Sina D\"aubener, Kira Maag, David Krueger, Asja Fischer

TL;DR
This paper combines randomized smoothing with uncertainty-based rejection to provide enhanced robustness guarantees for neural networks, allowing for better detection of adversarial attacks and out-of-distribution inputs.
Contribution
It introduces a novel framework integrating uncertainty quantification into certified robustness, deriving new guarantees and improving robustness metrics.
Findings
Up to 20.93% larger robustness radius on CIFAR10 with uncertainty rejection.
Enhanced robustness guarantees for uncertainty-aware classifiers.
Improved out-of-distribution detection using uncertainty measures.
Abstract
Deep neural networks have proven to be extremely powerful, however, they are also vulnerable to adversarial attacks which can cause hazardous incorrect predictions in safety-critical applications. Certified robustness via randomized smoothing gives a probabilistic guarantee that the smoothed classifier's predictions will not change within an -ball around a given input. On the other hand (uncertainty) score-based rejection is a technique often applied in practice to defend models against adversarial attacks. In this work, we fuse these two approaches by integrating a classifier that abstains from predicting when uncertainty is high into the certified robustness framework. This allows us to derive two novel robustness guarantees for uncertainty aware classifiers, namely (i) the radius of an -ball around the input in which the same label is predicted and uncertainty remains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Probabilistic and Robust Engineering Design · Adversarial Robustness in Machine Learning
MethodsAttentive Walk-Aggregating Graph Neural Network · Randomized Smoothing
