Securing Federated Learning against Backdoor Threats with Foundation Model Integration
Xiaohuan Bi, Xi Li

TL;DR
This paper introduces a new defense method for federated learning that effectively detects and mitigates backdoor attacks, including novel attacks exploiting foundation models, by constraining internal model activations during training.
Contribution
It proposes a data-free defense strategy based on constraining internal activations to defend against both classic and novel backdoor attacks in federated learning.
Findings
Effective against novel backdoor attacks exploiting foundation models
Outperforms existing defenses in experimental evaluations
Preserves model functionality while mitigating backdoors
Abstract
Federated Learning (FL) enables decentralized model training while preserving privacy. Recently, the integration of Foundation Models (FMs) into FL has enhanced performance but introduced a novel backdoor attack mechanism. Attackers can exploit FM vulnerabilities to embed backdoors into synthetic data generated by FMs. During global model fusion, these backdoors are transferred to the global model through compromised synthetic data, subsequently infecting all client models. Existing FL backdoor defenses are ineffective against this novel attack due to its fundamentally different mechanism compared to classic ones. In this work, we propose a novel data-free defense strategy that addresses both classic and novel backdoor attacks in FL. The shared attack pattern lies in the abnormal activations within the hidden feature space during model aggregation. Hence, we propose to constrain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Data Security Solutions · Security and Verification in Computing · Adversarial Robustness in Machine Learning
