Attack of the Tails: Yes, You Really Can Backdoor Federated Learning
Hongyi Wang, Kartik Sreenivasan, Shashank Rajput, Harit Vishwakarma,, Saurabh Agarwal, Jy-yong Sohn, Kangwook Lee, Dimitris Papailiopoulos

TL;DR
This paper demonstrates that federated learning models are vulnerable to backdoor attacks, especially edge-case backdoors that exploit tail inputs, and shows that defending against such attacks is theoretically and practically challenging.
Contribution
The work introduces a new class of edge-case backdoor attacks in federated learning and provides theoretical evidence that detecting or defending against them is computationally hard.
Findings
Edge-case backdoors can cause misclassification on unlikely inputs.
Robustness to backdoors implies vulnerability to adversarial examples.
Detecting backdoors is computationally infeasible under certain assumptions.
Abstract
Due to its decentralized nature, Federated Learning (FL) lends itself to adversarial attacks in the form of backdoors during training. The goal of a backdoor is to corrupt the performance of the trained model on specific sub-tasks (e.g., by classifying green cars as frogs). A range of FL backdoor attacks have been introduced in the literature, but also methods to defend against them, and it is currently an open question whether FL systems can be tailored to be robust against backdoors. In this work, we provide evidence to the contrary. We first establish that, in the general case, robustness to backdoors implies model robustness to adversarial examples, a major open problem in itself. Furthermore, detecting the presence of a backdoor in a FL model is unlikely assuming first order oracles or polynomial time. We couple our theoretical results with a new family of backdoor attacks, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Machine Learning and Algorithms
