Fingerprint Attack: Client De-Anonymization in Federated Learning

Qiongkai Xu; Trevor Cohn; Olga Ohrimenko

arXiv:2310.05960·cs.CR·October 11, 2023

Fingerprint Attack: Client De-Anonymization in Federated Learning

Qiongkai Xu, Trevor Cohn, Olga Ohrimenko

PDF

Open Access 1 Repo

TL;DR

This paper introduces a fingerprinting attack on federated learning gradients that can de-anonymize participants, and demonstrates that differential privacy can effectively defend against this attack.

Contribution

It presents a novel fingerprinting attack on federated learning gradients and evaluates its effectiveness, proposing differential privacy as a practical countermeasure.

Findings

01

Gradient clustering can de-anonymize participants.

02

Differential privacy mitigates fingerprinting attack.

03

Empirical validation on language models.

Abstract

Federated Learning allows collaborative training without data sharing in settings where participants do not trust the central server and one another. Privacy can be further improved by ensuring that communication between the participants and the server is anonymized through a shuffle; decoupling the participant identity from their data. This paper seeks to examine whether such a defense is adequate to guarantee anonymity, by proposing a novel fingerprinting attack over gradients sent by the participants to the server. We show that clustering of gradients can easily break the anonymization in an empirical study of learning federated language models on two language corpora. We then show that training with differential privacy can provide a practical defense against our fingerprint attack.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xuqiongkai/fingerprintattack_on_fl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Hate Speech and Cyberbullying Detection