BayBFed: Bayesian Backdoor Defense for Federated Learning
Kavita Kumari, Phillip Rieger, Hossein Fereidooni, Murtuza Jadliwala,, Ahmad-Reza Sadeghi

TL;DR
BayBFed introduces a Bayesian non-parametric framework for federated learning that effectively detects and filters malicious backdoor updates, improving security without harming model performance.
Contribution
It proposes a novel probabilistic approach using Bayesian non-parametric methods to detect malicious updates in federated learning, overcoming limitations of previous methods.
Findings
Effectively detects malicious updates across multiple datasets
Maintains model performance while filtering backdoor attacks
Outperforms existing defense strategies in robustness
Abstract
Federated learning (FL) allows participants to jointly train a machine learning model without sharing their private data with others. However, FL is vulnerable to poisoning attacks such as backdoor attacks. Consequently, a variety of defenses have recently been proposed, which have primarily utilized intermediary states of the global model (i.e., logits) or distance of the local models (i.e., L2-norm) from the global model to detect malicious backdoors. However, as these approaches directly operate on client updates, their effectiveness depends on factors such as clients' data distribution or the adversary's attack strategies. In this paper, we introduce a novel and more generic backdoor defense framework, called BayBFed, which proposes to utilize probability distributions over client updates to detect malicious updates in FL: it computes a probabilistic measure over the clients'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Network Security and Intrusion Detection · Adversarial Robustness in Machine Learning
