Knowledge-driven Subword Grammar Modeling for Automatic Speech Recognition in Tamil and Kannada
Madhavaraj A, Bharathi Pilar, Ramakrishnan A G

TL;DR
This paper introduces a knowledge-driven subword grammar modeling approach for ASR in Tamil and Kannada, improving recognition accuracy by capturing complex word formations and using linguistic knowledge.
Contribution
It presents a novel subword grammar weighted finite state transducer framework combined with heuristic segmentation and linguistic knowledge for agglutinative languages.
Findings
Achieved 12.39% and 13.56% word error rate reduction in Tamil and Kannada ASR systems.
Developed subword dictionaries and SG-WFST graphs based on language-specific morphological rules.
Enhanced recognition of complex words through heuristic segmentation and linguistic knowledge integration.
Abstract
In this paper, we present specially designed automatic speech recognition (ASR) systems for the highly agglutinative and inflective languages of Tamil and Kannada that can recognize unlimited vocabulary of words. We use subwords as the basic lexical units for recognition and construct subword grammar weighted finite state transducer (SG-WFST) graphs for word segmentation that captures most of the complex word formation rules of the languages. We have identified the following category of words (i) verbs, (ii) nouns, (ii) pronouns, and (iv) numbers. The prefix, infix and suffix lists of subwords are created for each of these categories and are used to design the SG-WFST graphs. We also present a heuristic segmentation algorithm that can even segment exceptional words that do not follow the rules encapsulated in the SG-WFST graph. Most of the data-driven subword dictionary creation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Speech and dialogue systems
MethodsTest
