Data Poisoning Attacks Against Federated Learning Systems

Vale Tolpegin; Stacey Truex; Mehmet Emre Gursoy; and Ling Liu

arXiv:2007.08432·cs.LG·August 13, 2020·51 cites

Data Poisoning Attacks Against Federated Learning Systems

Vale Tolpegin, Stacey Truex, Mehmet Emre Gursoy, and Ling Liu

PDF

Open Access 2 Repos

TL;DR

This paper investigates targeted data poisoning attacks in federated learning, demonstrating their effectiveness in reducing model accuracy and proposing a defense to identify malicious participants.

Contribution

It introduces a novel analysis of targeted poisoning attacks in federated learning and proposes an effective defense strategy to mitigate such threats.

Findings

01

Poisoning attacks cause significant drops in accuracy.

02

Attacks can be targeted to specific classes.

03

Defense strategy effectively detects malicious participants.

Abstract

Federated learning (FL) is an emerging paradigm for distributed training of large-scale deep neural networks in which participants' data remains on their own devices with only model updates being shared with a central server. However, the distributed nature of FL gives rise to new threats caused by potentially malicious participants. In this paper, we study targeted data poisoning attacks against FL systems in which a malicious subset of the participants aim to poison the global model by sending model updates derived from mislabeled data. We first demonstrate that such data poisoning attacks can cause substantial drops in classification accuracy and recall, even with a small percentage of malicious participants. We additionally show that the attacks can be targeted, i.e., they have a large negative impact only on classes that are under attack. We also study attack longevity in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques