Anomaly Localization in Model Gradients Under Backdoor Attacks Against Federated Learning
Zeki Bilgin

TL;DR
This paper investigates gradient anomalies caused by backdoor attacks in federated learning, revealing that malicious updates mainly affect the final layer bias weights, supported by theoretical and experimental analysis.
Contribution
It provides a deep analysis of gradient variations under backdoor attacks, identifying the final layer bias weights as key indicators of malicious activity.
Findings
Backdoor anomalies mainly appear in final layer bias weights.
The number of malicious clients influences the severity of anomalies.
Learning rate and malicious data rate affect gradient anomaly visibility.
Abstract
Inserting a backdoor into the joint model in federated learning (FL) is a recent threat raising concerns. Existing studies mostly focus on developing effective countermeasures against this threat, assuming that backdoored local models, if any, somehow reveal themselves by anomalies in their gradients. However, this assumption needs to be elaborated by identifying specifically which gradients are more likely to indicate an anomaly to what extent under which conditions. This is an important issue given that neural network models usually have huge parametric space and consist of a large number of weights. In this study, we make a deep gradient-level analysis on the expected variations in model gradients under several backdoor attack scenarios against FL. Our main novel finding is that backdoor-induced anomalies in local model updates (weights or gradients) appear in the final layer bias…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques
