A Novel Defense Against Poisoning Attacks on Federated Learning: LayerCAM Augmented with Autoencoder
Jingjing Zheng, Xin Yuan, Kai Li, Wei Ni, Eduardo Tovar, Jon, Crowcroft

TL;DR
This paper introduces LayerCAM-AE, a novel defense mechanism for federated learning that combines Layer Class Activation Mapping with an autoencoder to detect and mitigate model poisoning attacks effectively.
Contribution
The paper proposes a new LayerCAM-AE method integrating LayerCAM and autoencoder for improved detection of malicious updates in federated learning.
Findings
LayerCAM-AE achieves perfect detection metrics (Recall, Precision, F1, AUC) in experiments.
It outperforms existing defenses like ResNet-50 and REGNETY-800MF.
The method is effective under both IID and non-IID data distributions.
Abstract
Recent attacks on federated learning (FL) can introduce malicious model updates that circumvent widely adopted Euclidean distance-based detection methods. This paper proposes a novel defense strategy, referred to as LayerCAM-AE, designed to counteract model poisoning in federated learning. The LayerCAM-AE puts forth a new Layer Class Activation Mapping (LayerCAM) integrated with an autoencoder (AE), significantly enhancing detection capabilities. Specifically, LayerCAM-AE generates a heat map for each local model update, which is then transformed into a more compact visual format. The autoencoder is designed to process the LayerCAM heat maps from the local model updates, improving their distinctiveness and thereby increasing the accuracy in spotting anomalous maps and malicious local models. To address the risk of misclassifications with LayerCAM-AE, a voting algorithm is developed,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Adversarial Robustness in Machine Learning
