BaFFLe: Backdoor detection via Feedback-based Federated Learning
Sebastien Andreina, Giorgia Azzurra Marson, Helen M\"ollering, Ghassan, Karame

TL;DR
BAFFLE is a novel federated learning defense that uses client feedback and diverse datasets to effectively detect backdoor poisoning attacks with high accuracy and low false positives.
Contribution
This paper introduces BAFFLE, a feedback-based method leveraging client data diversity to detect backdoor attacks in federated learning, outperforming existing defenses.
Findings
Achieves 100% detection accuracy on CIFAR-10 and FEMNIST datasets.
Maintains false-positive rate below 5%.
Effectively detects adaptive backdoor attacks.
Abstract
Recent studies have shown that federated learning (FL) is vulnerable to poisoning attacks that inject a backdoor into the global model. These attacks are effective even when performed by a single client, and undetectable by most existing defensive techniques. In this paper, we propose Backdoor detection via Feedback-based Federated Learning (BAFFLE), a novel defense to secure FL against backdoor attacks. The core idea behind BAFFLE is to leverage data of multiple clients not only for training but also for uncovering model poisoning. We exploit the availability of diverse datasets at the various clients by incorporating a feedback loop into the FL process, to integrate the views of those clients when deciding whether a given model update is genuine or not. We show that this powerful construct can achieve very high detection rates against state-of-the-art backdoor attacks, even when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
