Study of Indian English Pronunciation Variabilities relative to Received Pronunciation
Priyanshi Pal, Shelly Jain, Anil Vuppala, Chiranjeevi Yarra, Prasanta, Ghosh

TL;DR
This paper analyzes Indian English pronunciation variabilities relative to Received Pronunciation using a large phonetic dataset, deriving data-driven phonetic rules and validating them through G2P conversion performance.
Contribution
It introduces a data-driven approach to characterize Indian English pronunciation variabilities and derives new phonetic rules from a large corpus, addressing limitations of qualitative analyses.
Findings
Derived new phonetic rules for Indian English pronunciation variabilities.
Validated rules improve Grapheme-to-Phoneme conversion accuracy.
Provided a comprehensive phonetic analysis of diverse Indian English speakers.
Abstract
Analysis of Indian English (IE) pronunciation variabilities are useful in building systems for Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) synthesis in the Indian context. Typically, these pronunciation variabilities have been explored by comparing IE pronunciation with Received Pronunciation (RP). However, to explore these variabilities, it is required to have labelled pronunciation data at the phonetic level, which is scarce for IE. Moreover, versatility of IE stems from the influence of a large diversity of the speakers' mother tongues and demographic region differences. Prior linguistic works have characterised features of IE variabilities qualitatively by reporting phonetic rules that represent such variations relative to RP. The qualitative descriptions often lack quantitative descriptors and data-driven analysis of diverse IE pronunciation data to characterise IE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Speech and Audio Processing
