Subject Data Auditing via Source Inference Attack in Cross-Silo   Federated Learning

Jiaxin Li; Marco Arazzi; Antonino Nocera; Mauro Conti

arXiv:2409.19417·cs.CR·October 1, 2024

Subject Data Auditing via Source Inference Attack in Cross-Silo Federated Learning

Jiaxin Li, Marco Arazzi, Antonino Nocera, Mauro Conti

PDF

Open Access

TL;DR

This paper introduces a novel subject-level source inference attack in cross-silo federated learning, significantly improving detection accuracy of client data usage from specific subjects, and proposes privacy defenses.

Contribution

It presents the first subject-level source inference attack removing previous constraints, enhancing attack accuracy, and explores privacy-preserving defenses in federated learning.

Findings

01

SLSIA achieves up to 0.88 accuracy on 50 subjects.

02

Datasets with sparse subjects are more vulnerable.

03

Proposed defenses include item-level and subject-level differential privacy.

Abstract

Source Inference Attack (SIA) in Federated Learning (FL) aims to identify which client used a target data point for local model training. It allows the central server to audit clients' data usage. In cross-silo FL, a client (silo) collects data from multiple subjects (e.g., individuals, writers, or devices), posing a risk of subject information leakage. Subject Membership Inference Attack (SMIA) targets this scenario and attempts to infer whether any client utilizes data points from a target subject in cross-silo FL. However, existing results on SMIA are limited and based on strong assumptions on the attack scenario. Therefore, we propose a Subject-Level Source Inference Attack (SLSIA) by removing critical constraints that only one client can use a target data point in SIA and imprecise detection of clients utilizing target subject data in SMIA. The attacker, positioned on the server…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Adversarial Robustness in Machine Learning

MethodsSparse Evolutionary Training