Neurotoxin: Durable Backdoors in Federated Learning

Zhengming Zhang; Ashwinee Panda; Linyue Song; Yaoqing Yang; Michael W.; Mahoney; Joseph E. Gonzalez; Kannan Ramchandran; Prateek Mittal

arXiv:2206.10341·cs.CR·June 22, 2022·33 cites

Neurotoxin: Durable Backdoors in Federated Learning

Zhengming Zhang, Ashwinee Panda, Linyue Song, Yaoqing Yang, Michael W., Mahoney, Joseph E. Gonzalez, Kannan Ramchandran, Prateek Mittal

PDF

Open Access 2 Repos

TL;DR

This paper introduces Neurotoxin, a modification to backdoor attacks in federated learning that significantly increases the durability of backdoors across various tasks, posing a stronger threat to FL systems.

Contribution

Neurotoxin is a simple one-line modification that enhances the durability of backdoors in federated learning models, making them persist longer during training.

Findings

01

Doubles the durability of state-of-the-art backdoors

02

Effective across ten NLP and computer vision tasks

03

Increases threat level of backdoor attacks in FL systems

Abstract

Due to their decentralized nature, federated learning (FL) systems have an inherent vulnerability during their training to adversarial backdoor attacks. In this type of attack, the goal of the attacker is to use poisoned updates to implant so-called backdoors into the learned model such that, at test time, the model's outputs can be fixed to a given target for certain inputs. (As a simple toy example, if a user types "people from New York" into a mobile keyboard app that uses a backdoored next word prediction model, then the model could autocomplete the sentence to "people from New York are rude"). Prior work has shown that backdoors can be inserted into FL models, but these backdoors are often not durable, i.e., they do not remain in the model after the attacker stops uploading poisoned updates. Thus, since training typically continues progressively in production FL systems, an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data

MethodsTest