Improving on-device speaker verification using federated learning with privacy
Filip Granqvist, Matt Seigel, Rogier van Dalen, \'Aine Cahill, Stephen, Shum, Matthias Paulik

TL;DR
This paper demonstrates that privacy-preserving federated learning can enhance on-device speaker verification by utilizing auxiliary models to predict speaker characteristics, leading to a 6% EER improvement.
Contribution
It introduces a federated learning approach with differential privacy for training auxiliary models that improve speaker verification accuracy without compromising user privacy.
Findings
Achieved a 6% relative reduction in equal error rate.
Enabled large-scale training with privacy guarantees.
Improved speaker verification accuracy using auxiliary side information.
Abstract
Information on speaker characteristics can be useful as side information in improving speaker recognition accuracy. However, such information is often private. This paper investigates how privacy-preserving learning can improve a speaker verification system, by enabling the use of privacy-sensitive speaker data to train an auxiliary classification model that predicts vocal characteristics of speakers. In particular, this paper explores the utility achieved by approaches which combine different federated learning and differential privacy mechanisms. These approaches make it possible to train a central model while protecting user privacy, with users' data remaining on their devices. Furthermore, they make learning on a large population of speakers possible, ensuring good coverage of speaker characteristics when training a model. The auxiliary model described here uses features extracted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
