Structural Analysis of Hindi Phonetics and A Method for Extraction of   Phonetically Rich Sentences from a Very Large Hindi Text Corpus

Shrikant Malviya; Rohit Mishra; Uma Shanker Tiwary

arXiv:1701.08655·cs.CL·February 8, 2017

Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus

Shrikant Malviya, Rohit Mishra, Uma Shanker Tiwary

PDF

TL;DR

This paper analyzes Hindi phonetics and proposes a two-stage algorithm to extract a minimal set of phonetically rich sentences from a large corpus, improving speech recognition and synthesis training.

Contribution

It introduces a statistical analysis of Hindi phonetics and a novel two-stage selection algorithm for phonetically rich sentences with high triphone diversity.

Findings

01

Efficiently extracts a minimal, diverse set of sentences covering all phonetic units.

02

Improves triphone distribution uniformity in the selected sentence set.

03

Reduces the number of sentences needed for effective speech module training.

Abstract

Automatic speech recognition (ASR) and Text to speech (TTS) are two prominent area of research in human computer interaction nowadays. A set of phonetically rich sentences is in a matter of importance in order to develop these two interactive modules of HCI. Essentially, the set of phonetically rich sentences has to cover all possible phone units distributed uniformly. Selecting such a set from a big corpus with maintaining phonetic characteristic based similarity is still a challenging problem. The major objective of this paper is to devise a criteria in order to select a set of sentences encompassing all phonetic aspects of a corpus with size as minimum as possible. First, this paper presents a statistical analysis of Hindi phonetics by observing the structural characteristics. Further a two stage algorithm is proposed to extract phonetically rich sentences with a high variety of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.