Hidden Data Privacy Breaches in Federated Learning

Xueluan Gong; Yuji Wang; Shuaike Li; Mengyuan Sun; Songze Li; Qian; Wang; Kwok-Yan Lam; and Chen Chen

arXiv:2411.18269·cs.CL·November 28, 2024

Hidden Data Privacy Breaches in Federated Learning

Xueluan Gong, Yuji Wang, Shuaike Li, Mengyuan Sun, Songze Li, Qian, Wang, Kwok-Yan Lam, and Chen Chen

PDF

Open Access

TL;DR

This paper introduces a stealthy data-reconstruction attack on federated learning that can extract high-resolution sensitive data without detection, highlighting critical privacy vulnerabilities in current FL systems.

Contribution

The authors propose a novel, undetectable attack leveraging malicious code injection, sparse encoding, and block partitioning to reconstruct sensitive data in federated learning.

Findings

01

Outperforms five state-of-the-art attacks in detection scenarios

02

Effective on large-scale, high-resolution datasets

03

Applicable to both FedAVG and FedSGD scenarios

Abstract

Federated Learning (FL) emerged as a paradigm for conducting machine learning across broad and decentralized datasets, promising enhanced privacy by obviating the need for direct data sharing. However, recent studies show that attackers can steal private data through model manipulation or gradient analysis. Existing attacks are constrained by low theft quantity or low-resolution data, and they are often detected through anomaly monitoring in gradients or weights. In this paper, we propose a novel data-reconstruction attack leveraging malicious code injection, supported by two key techniques, i.e., distinctive and sparse encoding design and block partitioning. Unlike conventional methods that require detectable changes to the model, our method stealthily embeds a hidden model using parameter sharing to systematically extract sensitive data. The Fibonacci-based index design ensures…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Adversarial Robustness in Machine Learning