Subject Data Auditing via Source Inference Attack in Cross-Silo Federated Learning
Jiaxin Li, Marco Arazzi, Antonino Nocera, Mauro Conti

TL;DR
This paper introduces a novel subject-level source inference attack in cross-silo federated learning, significantly improving detection accuracy of client data usage from specific subjects, and proposes privacy defenses.
Contribution
It presents the first subject-level source inference attack removing previous constraints, enhancing attack accuracy, and explores privacy-preserving defenses in federated learning.
Findings
SLSIA achieves up to 0.88 accuracy on 50 subjects.
Datasets with sparse subjects are more vulnerable.
Proposed defenses include item-level and subject-level differential privacy.
Abstract
Source Inference Attack (SIA) in Federated Learning (FL) aims to identify which client used a target data point for local model training. It allows the central server to audit clients' data usage. In cross-silo FL, a client (silo) collects data from multiple subjects (e.g., individuals, writers, or devices), posing a risk of subject information leakage. Subject Membership Inference Attack (SMIA) targets this scenario and attempts to infer whether any client utilizes data points from a target subject in cross-silo FL. However, existing results on SMIA are limited and based on strong assumptions on the attack scenario. Therefore, we propose a Subject-Level Source Inference Attack (SLSIA) by removing critical constraints that only one client can use a target data point in SIA and imprecise detection of clients utilizing target subject data in SMIA. The attacker, positioned on the server…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Adversarial Robustness in Machine Learning
MethodsSparse Evolutionary Training
