Poisoning Semi-supervised Federated Learning via Unlabeled Data: Attacks and Defenses
Yi Liu, Xingliang Yuan, Ruihui Zhao, Cong Wang, Dusit Niyato, and, Yefeng Zheng

TL;DR
This paper uncovers a vulnerability in semi-supervised federated learning where malicious unlabeled data can poison the model, and proposes a defense strategy based on client selection and quality aggregation to mitigate such attacks.
Contribution
It introduces a novel poisoning attack exploiting semi-supervised learning properties and proposes a minimax optimization-based client selection defense to enhance robustness.
Findings
Poisoning attack effective with only 0.1% malicious data
Defense strategy significantly reduces attack impact
Attacks and defenses validated on multiple datasets
Abstract
Semi-supervised Federated Learning (SSFL) has recently drawn much attention due to its practical consideration, i.e., the clients may only have unlabeled data. In practice, these SSFL systems implement semi-supervised training by assigning a "guessed" label to the unlabeled data near the labeled data to convert the unsupervised problem into a fully supervised problem. However, the inherent properties of such semi-supervised training techniques create a new attack surface. In this paper, we discover and reveal a simple yet powerful poisoning attack against SSFL. Our attack utilizes the natural characteristic of semi-supervised learning to cause the model to be poisoned by poisoning unlabeled data. Specifically, the adversary just needs to insert a small number of maliciously crafted unlabeled samples (e.g., only 0.1\% of the dataset) to infect model performance and misclassification.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · HIV, Drug Use, Sexual Risk
