On-Device Voice Authentication with Paralinguistic Privacy
Ranya Aloufi, Hamed Haddadi, David Boyle

TL;DR
This paper presents a local, privacy-preserving voice authentication system that uses token-based credentials and liveness detection to verify users while safeguarding sensitive paralinguistic information, achieving high accuracy and fast performance.
Contribution
It introduces a novel on-device voice authentication framework combining token-based credentials, liveness detection, and customizable privacy filters, enhancing privacy and security in voice-based systems.
Findings
98.68% user verification accuracy
Runs in tens of milliseconds on standard hardware
Effectively filters raw voice data according to user privacy preferences
Abstract
Using our voices to access, and interact with, online services raises concerns about the trade-offs between convenience, privacy, and security. The conflict between maintaining privacy and ensuring input authenticity has often been hindered by the need to share raw data, which contains all the paralinguistic information required to infer a variety of sensitive characteristics. Users of voice assistants put their trust in service providers; however, this trust is potentially misplaced considering the emergence of first-party 'honest-but-curious' or 'semi-honest' threats. A further security risk is presented by imposters gaining access to systems by pretending to be the user leveraging replay or 'deepfake' attacks. Our objective is to design and develop a new voice input-based system that offers the following specifications: local authentication to reduce the need for sharing raw voice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUser Authentication and Security Systems · Internet Traffic Analysis and Secure E-voting · Speech Recognition and Synthesis
