TTS-Guided Training for Accent Conversion Without Parallel Data
Yi Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li

TL;DR
This paper introduces a non-parallel accent conversion method using a pretrained TTS system to transform speech accents while maintaining content and speaker identity, validated through experiments on English accents.
Contribution
A novel non-parallel accent conversion framework leveraging a pretrained TTS model and speech encoder to effectively change accents without parallel data.
Findings
High-quality accent conversion achieved without parallel data.
Objective and subjective tests confirm the effectiveness of the approach.
Successful conversion of Chinese and Indian accents to American, British, and Canadian accents.
Abstract
Accent Conversion (AC) seeks to change the accent of speech from one (source) to another (target) while preserving the speech content and speaker identity. However, many AC approaches rely on source-target parallel speech data. We propose a novel accent conversion framework without the need of parallel data. Specifically, a text-to-speech (TTS) system is first pretrained with target-accented speech data. This TTS model and its hidden representations are expected to be associated only with the target accent. Then, a speech encoder is trained to convert the accent of the speech under the supervision of the pretrained TTS model. In doing so, the source-accented speech and its corresponding transcription are forwarded to the speech encoder and the pretrained TTS, respectively. The output of the speech encoder is optimized to be the same as the text embedding in the TTS system. At run-time,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Phonetics and Phonology Research
