RAB: Provable Robustness Against Backdoor Attacks

Maurice Weber; Xiaojun Xu; Bojan Karla\v{s}; Ce Zhang; Bo Li

arXiv:2003.08904·cs.LG·August 4, 2023·48 cites

RAB: Provable Robustness Against Backdoor Attacks

Maurice Weber, Xiaojun Xu, Bojan Karla\v{s}, Ce Zhang, Bo Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces RAB, a novel training method that certifies and enhances the robustness of various machine learning models, including deep neural networks and K-NN classifiers, against backdoor and evasion attacks using randomized smoothing techniques.

Contribution

It presents the first robust training process, RAB, with proven tight robustness bounds against backdoor attacks and an exact smooth-training algorithm for simple models.

Findings

01

RAB effectively certifies robustness against backdoor attacks.

02

The exact smooth-training algorithm improves efficiency for simple models.

03

Empirical results demonstrate enhanced robustness across multiple datasets and models.

Abstract

Recent studies have shown that deep neural networks (DNNs) are vulnerable to adversarial attacks, including evasion and backdoor (poisoning) attacks. On the defense side, there have been intensive efforts on improving both empirical and provable robustness against evasion attacks; however, the provable robustness against backdoor attacks still remains largely unexplored. In this paper, we focus on certifying the machine learning model robustness against general threat models, especially backdoor attacks. We first provide a unified framework via randomized smoothing techniques and show how it can be instantiated to certify the robustness against both evasion and backdoor attacks. We then propose the first robust training process, RAB, to smooth the trained model and certify its robustness against backdoor attacks. We prove the robustness bound for machine learning models trained with RAB…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AI-secure/Robustness-Against-Backdoor-Attacks
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Cardiac Arrest and Resuscitation

MethodsRandomized Smoothing · k-Nearest Neighbors