DeepSight: Mitigating Backdoor Attacks in Federated Learning Through   Deep Model Inspection

Phillip Rieger; Thien Duc Nguyen; Markus Miettinen; Ahmad-Reza Sadeghi

arXiv:2201.00763·cs.CR·January 4, 2022

DeepSight: Mitigating Backdoor Attacks in Federated Learning Through Deep Model Inspection

Phillip Rieger, Thien Duc Nguyen, Markus Miettinen, Ahmad-Reza Sadeghi

PDF

TL;DR

DeepSight is a novel approach that effectively detects and mitigates backdoor attacks in federated learning by analyzing model updates' internal structures and data distributions, outperforming existing defenses with minimal impact on benign models.

Contribution

DeepSight introduces a new model filtering method based on data distribution and internal structure analysis to identify and eliminate poisoned model updates in federated learning.

Findings

01

Successfully mitigates state-of-the-art backdoor attacks

02

Maintains high model performance on benign data

03

Outperforms existing defenses in detection accuracy

Abstract

Federated Learning (FL) allows multiple clients to collaboratively train a Neural Network (NN) model on their private data without revealing the data. Recently, several targeted poisoning attacks against FL have been introduced. These attacks inject a backdoor into the resulting model that allows adversary-controlled inputs to be misclassified. Existing countermeasures against backdoor attacks are inefficient and often merely aim to exclude deviating models from the aggregation. However, this approach also removes benign models of clients with deviating data distributions, causing the aggregated model to perform poorly for such clients. To address this problem, we propose DeepSight, a novel model filtering approach for mitigating backdoor attacks. It is based on three novel techniques that allow to characterize the distribution of data used to train model updates and seek to measure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.