Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an   End-to-end Framework

Anusha Prakash; Hema A Murthy

arXiv:2006.06971·eess.AS·November 1, 2022·Interspeech

Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework

Anusha Prakash, Hema A Murthy

PDF

TL;DR

This paper presents a generic end-to-end TTS framework for Indian languages that leverages language family similarities, enabling rapid adaptation to new languages with minimal data and preserving speaker prosody.

Contribution

The work introduces a novel multi-language generic TTS system exploiting language family properties, with effective adaptation to new languages using only 7 minutes of data.

Findings

01

High-quality TTS with 3.98 MOS after adaptation

02

Effective language and speaker switching capabilities

03

Rapid adaptation with minimal data (7 minutes)

Abstract

Building text-to-speech (TTS) synthesisers for Indian languages is a difficult task owing to a large number of active languages. Indian languages can be classified into a finite set of families, prominent among them, Indo-Aryan and Dravidian. The proposed work exploits this property to build a generic TTS system using multiple languages from the same family in an end-to-end framework. Generic systems are quite robust as they are capable of capturing a variety of phonotactics across languages. These systems are then adapted to a new language in the same family using small amounts of adaptation data. Experiments indicate that good quality TTS systems can be built using only 7 minutes of adaptation data. An average degradation mean opinion score of 3.98 is obtained for the adapted TTSes. Extensive analysis of systematic interactions between languages in the generic TTSes is carried out.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.