Source Inference Attacks: Beyond Membership Inference Attacks in Federated Learning
Hongsheng Hu, Xuyun Zhang, Zoran Salcic, Lichao Sun, Kim-Kwang Raymond, Choo, Gillian Dobbie

TL;DR
This paper introduces a novel source inference attack in federated learning, revealing that servers can identify the source client of training data, thus exposing privacy beyond traditional membership inference vulnerabilities.
Contribution
It proposes the first source inference attack (SIA) in federated learning, leveraging Bayesian methods for non-intrusive source identification, and evaluates its effectiveness across multiple frameworks and datasets.
Findings
SIAs successfully identify source clients in various FL frameworks.
Source information leaks occur through gradients, model parameters, and predictions.
Experimental results confirm the attack's high efficacy.
Abstract
Federated learning (FL) is a popular approach to facilitate privacy-aware machine learning since it allows multiple clients to collaboratively train a global model without granting others access to their private data. It is, however, known that FL can be vulnerable to membership inference attacks (MIAs), where the training records of the global model can be distinguished from the testing records. Surprisingly, research focusing on the investigation of the source inference problem appears to be lacking. We also observe that identifying a training record's source client can result in privacy breaches extending beyond MIAs. For example, consider an FL application where multiple hospitals jointly train a COVID-19 diagnosis model, membership inference attackers can identify the medical records that have been used for training, and any additional identification of the source hospital can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning
