Non-native English lexicon creation for bilingual speech synthesis

Arun Baby; Pranav Jawale; Saranya Vinnaitherthan; Sumukh Badam,; Nagaraj Adiga; Sharath Adavanne

arXiv:2106.10870·eess.AS·June 22, 2021

Non-native English lexicon creation for bilingual speech synthesis

Arun Baby, Pranav Jawale, Saranya Vinnaitherthan, Sumukh Badam,, Nagaraj Adiga, Sharath Adavanne

PDF

TL;DR

This paper introduces a method to create non-native English lexicons for bilingual TTS systems by mapping native lexicons using letter-to-phoneme alignment, improving speech intelligibility for non-native speakers.

Contribution

It proposes a novel rule-based approach to generate non-native English lexicons from native lexicons, enhancing bilingual TTS performance.

Findings

01

6% preference improvement in subjective evaluation

02

Effective mapping of native to non-native phoneme sequences

03

Enhanced speech intelligibility for non-native English speakers

Abstract

Bilingual English speakers speak English as one of their languages. Their English is of a non-native kind, and their conversations are of a code-mixed fashion. The intelligibility of a bilingual text-to-speech (TTS) system for such non-native English speakers depends on a lexicon that captures the phoneme sequence used by non-native speakers. However, due to the lack of non-native English lexicon, existing bilingual TTS systems employ native English lexicons that are widely available, in addition to their native language lexicon. Due to the inconsistency between the non-native English pronunciation in the audio and native English lexicon in the text, the intelligibility of synthesized speech in such TTS systems is significantly reduced. This paper is motivated by the knowledge that the native language of the speaker highly influences non-native English pronunciation. We propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.