On the Robustness of Bayesian Neural Networks to Adversarial Attacks

Luca Bortolussi; Ginevra Carbone; Luca Laurenti; Andrea Patane; Guido; Sanguinetti; Matthew Wicker

arXiv:2207.06154·cs.LG·February 29, 2024·1 cites

On the Robustness of Bayesian Neural Networks to Adversarial Attacks

Luca Bortolussi, Ginevra Carbone, Luca Laurenti, Andrea Patane, Guido, Sanguinetti, Matthew Wicker

PDF

Open Access 2 Repos

TL;DR

This paper investigates the robustness of Bayesian Neural Networks against adversarial attacks, revealing that in the large-data limit, BNNs are inherently more resistant due to data degeneracy, with experimental validation on multiple datasets.

Contribution

It provides a theoretical analysis linking data geometry to BNN robustness and demonstrates that BNN posteriors are robust to gradient-based attacks in the large-data limit.

Findings

01

BNNs are robust to gradient-based attacks in the large-data limit.

02

Data lying on lower-dimensional manifolds causes vulnerability in neural networks.

03

Experiments show BNNs maintain accuracy and robustness on multiple datasets.

Abstract

Vulnerability to adversarial attacks is one of the principal hurdles to the adoption of deep learning in safety-critical applications. Despite significant efforts, both practical and theoretical, training deep learning models robust to adversarial attacks is still an open problem. In this paper, we analyse the geometry of adversarial attacks in the large-data, overparameterized limit for Bayesian Neural Networks (BNNs). We show that, in the limit, vulnerability to gradient-based attacks arises as a result of degeneracy in the data distribution, i.e., when the data lies on a lower-dimensional submanifold of the ambient space. As a direct consequence, we demonstrate that in this limit BNN posteriors are robust to gradient-based adversarial attacks. Crucially, we prove that the expected gradient of the loss with respect to the BNN posterior distribution is vanishing, even when each neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsVariational Inference