Towards End-to-End Private Automatic Speaker Recognition
Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso

TL;DR
This paper proposes a method for private automatic speaker recognition that allows speaker embeddings to be extracted securely using Secure Multiparty Computation, addressing privacy and security concerns in voice authentication systems.
Contribution
It introduces a novel approach to extract speaker embeddings privately, maintaining both user voice privacy and model confidentiality using Secure Multiparty Computation.
Findings
Secure embedding extraction is feasible with reasonable trade-offs.
The method enhances security by preventing model knowledge leakage.
It advances towards fully private speaker recognition systems.
Abstract
The development of privacy-preserving automatic speaker verification systems has been the focus of a number of studies with the intent of allowing users to authenticate themselves without risking the privacy of their voice. However, current privacy-preserving methods assume that the template voice representations (or speaker embeddings) used for authentication are extracted locally by the user. This poses two important issues: first, knowledge of the speaker embedding extraction model may create security and robustness liabilities for the authentication system, as this knowledge might help attackers in crafting adversarial examples able to mislead the system; second, from the point of view of a service provider the speaker embedding extraction model is arguably one of the most valuable components in the system and, as such, disclosing it would be highly undesirable. In this work, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
