Data Poisoning Attacks Against Federated Learning Systems
Vale Tolpegin, Stacey Truex, Mehmet Emre Gursoy, and Ling Liu

TL;DR
This paper investigates targeted data poisoning attacks in federated learning, demonstrating their effectiveness in reducing model accuracy and proposing a defense to identify malicious participants.
Contribution
It introduces a novel analysis of targeted poisoning attacks in federated learning and proposes an effective defense strategy to mitigate such threats.
Findings
Poisoning attacks cause significant drops in accuracy.
Attacks can be targeted to specific classes.
Defense strategy effectively detects malicious participants.
Abstract
Federated learning (FL) is an emerging paradigm for distributed training of large-scale deep neural networks in which participants' data remains on their own devices with only model updates being shared with a central server. However, the distributed nature of FL gives rise to new threats caused by potentially malicious participants. In this paper, we study targeted data poisoning attacks against FL systems in which a malicious subset of the participants aim to poison the global model by sending model updates derived from mislabeled data. We first demonstrate that such data poisoning attacks can cause substantial drops in classification accuracy and recall, even with a small percentage of malicious participants. We additionally show that the attacks can be targeted, i.e., they have a large negative impact only on classes that are under attack. We also study attack longevity in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques
