Model Hijacking Attack in Federated Learning
Zheng Li, Siyuan Wu, Ruichuan Chen, Paarijaat Aditya, Istemi Ekin, Akkus, Manohar Vanga, Min Zhang, Hao Li, Yang Zhang

TL;DR
This paper introduces HijackFL, a novel attack method that hijacks federated learning models to perform malicious tasks without detection, using pixel-level perturbations based on local models.
Contribution
It is the first to extend model hijacking attacks to federated learning, demonstrating effective hijacking via pixel perturbations without data poisoning.
Findings
HijackFL outperforms baseline attacks in experiments.
The attack is effective across multiple datasets and models.
Potential defenses are discussed to mitigate hijacking risks.
Abstract
Machine learning (ML), driven by prominent paradigms such as centralized and federated learning, has made significant progress in various critical applications ranging from autonomous driving to face recognition. However, its remarkable success has been accompanied by various attacks. Recently, the model hijacking attack has shown that ML models can be hijacked to execute tasks different from their original tasks, which increases both accountability and parasitic computational risks. Nevertheless, thus far, this attack has only focused on centralized learning. In this work, we broaden the scope of this attack to the federated learning domain, where multiple clients collaboratively train a global model without sharing their data. Specifically, we present HijackFL, the first-of-its-kind hijacking attack against the global model in federated learning. The adversary aims to force the global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Network Security and Intrusion Detection
MethodsALIGN
