Provable Robustness against Backdoor Attacks via the Primal-Dual Perspective on Differential Privacy
Aman Saxena, Jan Schuchardt, Yan Scholten, Stephan G\"unnemann

TL;DR
This paper introduces a unified framework connecting randomized smoothing and differential privacy to certify robustness against complex backdoor attacks in machine learning models.
Contribution
It develops a modular, end-to-end certification method for complex mechanisms using the dual view of differential privacy, addressing backdoor attack robustness.
Findings
Framework provides tight robustness guarantees for backdoor attacks.
Experiments demonstrate effectiveness on MNIST and CIFAR-10.
Enables compositional analysis of randomized mechanisms for security.
Abstract
Randomized smoothing is a powerful tool for certifying robustness to adversarial perturbations, including poisoning attacks via randomized training and evasion attacks via randomized inference. Extending these guarantees to backdoor attacks, where training and test data are jointly perturbed, remains challenging because training- and test-time randomized mechanisms must be analyzed within a single robustness certificate. We address this by connecting randomized smoothing to the dual view of differential privacy through privacy profiles, which provide a numerical procedure for composing heterogeneous mechanisms. The resulting framework enables tight, modular, end-to-end certification of complex, composed mechanisms while leveraging existing analyses of differentially private mechanisms. We instantiate the framework for DP-SGD and Deep Partition Aggregation with inference-time smoothing,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
