Defending Your Voice: Adversarial Attack on Voice Conversion
Chien-yu Huang, Yist Y. Lin, Hung-yi Lee, Lin-shan Lee

TL;DR
This paper introduces the first adversarial attack method on voice conversion systems, using imperceptible noise to prevent voice impersonation, with promising results on state-of-the-art models.
Contribution
It presents a novel adversarial attack approach on voice conversion, enhancing privacy protection by making converted voices distinguishably different.
Findings
Adversarial noise effectively disrupts voice conversion.
Converted utterances differ significantly from the defended speaker.
Adversarial examples are indistinguishable from authentic speech.
Abstract
Substantial improvements have been achieved in recent years in voice conversion, which converts the speaker characteristics of an utterance into those of another speaker without changing the linguistic content of the utterance. Nonetheless, the improved conversion technologies also led to concerns about privacy and authentication. It thus becomes highly desired to be able to prevent one's voice from being improperly utilized with such voice conversion technologies. This is why we report in this paper the first known attempt to perform adversarial attack on voice conversion. We introduce human imperceptible noise into the utterances of a speaker whose voice is to be defended. Given these adversarial examples, voice conversion models cannot convert other utterances so as to sound like being produced by the defended speaker. Preliminary experiments were conducted on two currently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Adversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection
