Everyday Speech in the Indian Subcontinent

Utkarsh P

arXiv:2410.10508·cs.CL·February 24, 2025

Everyday Speech in the Indian Subcontinent

Utkarsh P

PDF

Open Access

TL;DR

This paper introduces a phonetics-based Common Label Set (CLS) for multilingual speech synthesis in India, enabling seamless code switching among 13 languages and English, reflecting everyday multilingual speech.

Contribution

It proposes a novel CLS approach that simplifies multilingual synthesis and supports code switching in Indian languages within an E2E framework.

Findings

01

Enables seamless code switching across 13 Indian languages and English.

02

Reduces vocabulary complexity in multilingual speech synthesis.

03

Supports natural everyday speech in multilingual Indian contexts.

Abstract

India has 1369 languages of which 22 are official. About 13 different scripts are used to represent these languages. A Common Label Set (CLS) was developed based on phonetics to address the issue of large vocabulary of units required in the End-to-End (E2E) framework for multilingual synthesis. The Indian language text is first converted to CLS. This approach enables seamless code switching across 13 Indian languages and English in a given native speaker's voice, which corresponds to everyday speech in the Indian subcontinent, where the population is multilingual.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage, Discourse, Communication Strategies · Multilingual Education and Policy · South Asian Studies and Conflicts

MethodsSparse Evolutionary Training