Provable Robustness against Backdoor Attacks via the Primal-Dual Perspective on Differential Privacy

Aman Saxena; Jan Schuchardt; Yan Scholten; Stephan G\"unnemann

arXiv:2605.21780·cs.LG·May 22, 2026

Provable Robustness against Backdoor Attacks via the Primal-Dual Perspective on Differential Privacy

Aman Saxena, Jan Schuchardt, Yan Scholten, Stephan G\"unnemann

PDF

TL;DR

This paper introduces a unified framework connecting randomized smoothing and differential privacy to certify robustness against complex backdoor attacks in machine learning models.

Contribution

It develops a modular, end-to-end certification method for complex mechanisms using the dual view of differential privacy, addressing backdoor attack robustness.

Findings

01

Framework provides tight robustness guarantees for backdoor attacks.

02

Experiments demonstrate effectiveness on MNIST and CIFAR-10.

03

Enables compositional analysis of randomized mechanisms for security.

Abstract

Randomized smoothing is a powerful tool for certifying robustness to adversarial perturbations, including poisoning attacks via randomized training and evasion attacks via randomized inference. Extending these guarantees to backdoor attacks, where training and test data are jointly perturbed, remains challenging because training- and test-time randomized mechanisms must be analyzed within a single robustness certificate. We address this by connecting randomized smoothing to the dual view of differential privacy through privacy profiles, which provide a numerical procedure for composing heterogeneous mechanisms. The resulting framework enables tight, modular, end-to-end certification of complex, composed mechanisms while leveraging existing analyses of differentially private mechanisms. We instantiate the framework for DP-SGD and Deep Partition Aggregation with inference-time smoothing,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.