I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy
Henry Turner, Giulio Lovisotto, Simon Eberz, Ivan Martinovic

TL;DR
AltVoice is a privacy-preserving system that re-synthesizes user voices into private identities without needing cooperation from remote voice services, enhancing user privacy and data revocability.
Contribution
The paper introduces AltVoice, a novel system that generates private voice identities from user speech, with six methods based on user secrets, addressing privacy and revocability issues.
Findings
Generated voices are hard to link to original users.
System effectively conceals user identity and protects against data leaks.
Further improvements needed for naturalness and distinctness of voices.
Abstract
In this paper, we present AltVoice -- a system designed to help user's protect their privacy when using remotely accessed voice services. The system allows a user to conceal their true voice identity information with no cooperation from the remote voice service: AltVoice re-synthesizes user's spoken audio to sound as if it has been spoken by a different, private identity. The system converts audio to its textual representation at its midpoint, and thus removes any linkage between the user's voice and the generated private voices. We implement AltVoice and we propose six different methods to generate private voice identities, each is based on a user-known secret. We identify the system's trade-offs, and we investigate them for each of the proposed identity generation methods. Specifically, we investigate generated voices' diversity, word error rate, perceived speech quality and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · User Authentication and Security Systems · Speech and Audio Processing
