RAB: Provable Robustness Against Backdoor Attacks
Maurice Weber, Xiaojun Xu, Bojan Karla\v{s}, Ce Zhang, Bo Li

TL;DR
This paper introduces RAB, a novel training method that certifies and enhances the robustness of various machine learning models, including deep neural networks and K-NN classifiers, against backdoor and evasion attacks using randomized smoothing techniques.
Contribution
It presents the first robust training process, RAB, with proven tight robustness bounds against backdoor attacks and an exact smooth-training algorithm for simple models.
Findings
RAB effectively certifies robustness against backdoor attacks.
The exact smooth-training algorithm improves efficiency for simple models.
Empirical results demonstrate enhanced robustness across multiple datasets and models.
Abstract
Recent studies have shown that deep neural networks (DNNs) are vulnerable to adversarial attacks, including evasion and backdoor (poisoning) attacks. On the defense side, there have been intensive efforts on improving both empirical and provable robustness against evasion attacks; however, the provable robustness against backdoor attacks still remains largely unexplored. In this paper, we focus on certifying the machine learning model robustness against general threat models, especially backdoor attacks. We first provide a unified framework via randomized smoothing techniques and show how it can be instantiated to certify the robustness against both evasion and backdoor attacks. We then propose the first robust training process, RAB, to smooth the trained model and certify its robustness against backdoor attacks. We prove the robustness bound for machine learning models trained with RAB…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Cardiac Arrest and Resuscitation
MethodsRandomized Smoothing · k-Nearest Neighbors
