Defending against the Label-flipping Attack in Federated Learning
Najeeb Moharram Jebreel, Josep Domingo-Ferrer, David S\'anchez and, Alberto Blanco-Justicia

TL;DR
This paper introduces a novel gradient-based detection method for label-flipping attacks in federated learning, effectively identifying malicious updates and improving model robustness across various data distributions and model sizes.
Contribution
The paper presents a new defense mechanism that analyzes gradients related to source and target classes, outperforming existing methods in detecting label-flipping attacks.
Findings
Effective detection of label-flipping attacks across datasets.
Outperforms state-of-the-art defenses in accuracy and stability.
Robust against high-dimensional models and diverse data distributions.
Abstract
Federated learning (FL) provides autonomy and privacy by design to participating peers, who cooperatively build a machine learning (ML) model while keeping their private data in their devices. However, that same autonomy opens the door for malicious peers to poison the model by conducting either untargeted or targeted poisoning attacks. The label-flipping (LF) attack is a targeted poisoning attack where the attackers poison their training data by flipping the labels of some examples from one class (i.e., the source class) to another (i.e., the target class). Unfortunately, this attack is easy to perform and hard to detect and it negatively impacts on the performance of the global model. Existing defenses against LF are limited by assumptions on the distribution of the peers' data and/or do not perform well with high-dimensional models. In this paper, we deeply investigate the LF attack…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data
MethodsTest
