Toward Malicious Clients Detection in Federated Learning
Zhihao Dou, Jiaqi Wang, Wei Sun, Zhuqing Liu, and Minghong Fang

TL;DR
This paper introduces SafeFL, a novel algorithm for accurately detecting malicious clients in federated learning by analyzing model behavior through synthetic datasets, significantly improving detection accuracy over existing methods.
Contribution
SafeFL is a new detection algorithm that leverages synthetic datasets generated from global models to identify malicious clients more accurately in federated learning.
Findings
SafeFL outperforms existing detection methods in accuracy.
SafeFL demonstrates high efficiency in identifying malicious clients.
The approach reduces false positives in malicious client detection.
Abstract
Federated learning (FL) enables multiple clients to collaboratively train a global machine learning model without sharing their raw data. However, the decentralized nature of FL introduces vulnerabilities, particularly to poisoning attacks, where malicious clients manipulate their local models to disrupt the training process. While Byzantine-robust aggregation rules have been developed to mitigate such attacks, they remain inadequate against more advanced threats. In response, recent advancements have focused on FL detection techniques to identify potentially malicious participants. Unfortunately, these methods often misclassify numerous benign clients as threats or rely on unrealistic assumptions about the server's capabilities. In this paper, we propose a novel algorithm, SafeFL, specifically designed to accurately identify malicious clients in FL. The SafeFL approach involves the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
