How to beat a Bayesian adversary
Zihan Ding, Kexin Jin, Jonas Latz, Chenguang Liu

TL;DR
This paper introduces Abram, a novel continuous-time particle system approach to Bayesian adversarial robustness in deep learning, offering a relaxation of traditional minmax adversarial training and demonstrating effectiveness in benchmarks.
Contribution
The paper proposes Abram, a new particle system method that approximates Bayesian adversarial robustness, providing a relaxation of minmax optimization and practical discretisation strategies.
Findings
Abram effectively approximates the Bayesian adversarial robustness problem.
Discretised Abram performs well in benchmark adversarial deep learning tasks.
Theoretical justification links Abram to minimisers of the robustness problem.
Abstract
Deep neural networks and other modern machine learning models are often susceptible to adversarial attacks. Indeed, an adversary may often be able to change a model's prediction through a small, directed perturbation of the model's input - an issue in safety-critical applications. Adversarially robust machine learning is usually based on a minmax optimisation problem that minimises the machine learning loss under maximisation-based adversarial attacks. In this work, we study adversaries that determine their attack using a Bayesian statistical approach rather than maximisation. The resulting Bayesian adversarial robustness problem is a relaxation of the usual minmax problem. To solve this problem, we propose Abram - a continuous-time particle system that shall approximate the gradient flow corresponding to the underlying learning problem. We show that Abram approximates a McKean-Vlasov…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis
