On the Robustness of Bayesian Neural Networks to Adversarial Attacks
Luca Bortolussi, Ginevra Carbone, Luca Laurenti, Andrea Patane, Guido, Sanguinetti, Matthew Wicker

TL;DR
This paper investigates the robustness of Bayesian Neural Networks against adversarial attacks, revealing that in the large-data limit, BNNs are inherently more resistant due to data degeneracy, with experimental validation on multiple datasets.
Contribution
It provides a theoretical analysis linking data geometry to BNN robustness and demonstrates that BNN posteriors are robust to gradient-based attacks in the large-data limit.
Findings
BNNs are robust to gradient-based attacks in the large-data limit.
Data lying on lower-dimensional manifolds causes vulnerability in neural networks.
Experiments show BNNs maintain accuracy and robustness on multiple datasets.
Abstract
Vulnerability to adversarial attacks is one of the principal hurdles to the adoption of deep learning in safety-critical applications. Despite significant efforts, both practical and theoretical, training deep learning models robust to adversarial attacks is still an open problem. In this paper, we analyse the geometry of adversarial attacks in the large-data, overparameterized limit for Bayesian Neural Networks (BNNs). We show that, in the limit, vulnerability to gradient-based attacks arises as a result of degeneracy in the data distribution, i.e., when the data lies on a lower-dimensional submanifold of the ambient space. As a direct consequence, we demonstrate that in this limit BNN posteriors are robust to gradient-based adversarial attacks. Crucially, we prove that the expected gradient of the loss with respect to the BNN posterior distribution is vanishing, even when each neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsVariational Inference
