Analyzing Federated Learning through an Adversarial Lens
Arjun Nitin Bhagoji, Supriyo Chakraborty, Prateek Mittal, Seraphin, Calo

TL;DR
This paper investigates the vulnerability of federated learning to model poisoning attacks by a single malicious agent, proposing strategies to execute stealthy attacks and highlighting the need for defenses.
Contribution
It introduces novel attack strategies for model poisoning in federated learning and demonstrates their effectiveness and stealthiness through interpretability techniques.
Findings
Malicious agents can successfully poison models with high confidence.
Stealthy attack strategies can evade detection by interpretability methods.
Federated learning is highly vulnerable to single-agent poisoning attacks.
Abstract
Federated learning distributes model training among a multitude of agents, who, guided by privacy concerns, perform training using their local data but share only model parameter updates, for iterative aggregation at the server. In this work, we explore the threat of model poisoning attacks on federated learning initiated by a single, non-colluding malicious agent where the adversarial objective is to cause the model to misclassify a set of chosen inputs with high confidence. We explore a number of strategies to carry out this attack, starting with simple boosting of the malicious agent's update to overcome the effects of other agents' updates. To increase attack stealth, we propose an alternating minimization strategy, which alternately optimizes for the training loss and the adversarial objective. We follow up by using parameter estimation for the benign agents' updates to improve on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
MethodsInterpretability
