An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
Spandan Dey, Md Sahidullah, Goutam Saha

TL;DR
This paper provides a comprehensive review of Indian spoken language recognition research, highlighting unique challenges, available resources, and recent advances in neural network approaches to improve multilingual voice identification systems.
Contribution
It offers one of the first detailed analyses of Indian LID research, covering speech corpora, methodologies, challenges, and future trends in a single comprehensive review.
Findings
Significant progress with neural network architectures
Availability of diverse speech corpora for Indian languages
Identification of key challenges like low-resource languages
Abstract
Automatic spoken language identification (LID) is a very important research field in the era of multilingual voice-command-based human-computer interaction (HCI). A front-end LID module helps to improve the performance of many speech-based applications in the multilingual scenario. India is a populous country with diverse cultures and languages. The majority of the Indian population needs to use their respective native languages for verbal interaction with machines. Therefore, the development of efficient Indian spoken language recognition systems is useful for adapting smart technologies in every section of Indian society. The field of Indian LID has started gaining momentum in the last two decades, mainly due to the development of several standard multilingual speech corpora for the Indian languages. Even though significant research progress has already been made in this field, to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
