An Overview of Indian Spoken Language Recognition from Machine Learning   Perspective

Spandan Dey; Md Sahidullah; Goutam Saha

arXiv:2212.03812·cs.CL·December 8, 2022

An Overview of Indian Spoken Language Recognition from Machine Learning Perspective

Spandan Dey, Md Sahidullah, Goutam Saha

PDF

TL;DR

This paper provides a comprehensive review of Indian spoken language recognition research, highlighting unique challenges, available resources, and recent advances in neural network approaches to improve multilingual voice identification systems.

Contribution

It offers one of the first detailed analyses of Indian LID research, covering speech corpora, methodologies, challenges, and future trends in a single comprehensive review.

Findings

01

Significant progress with neural network architectures

02

Availability of diverse speech corpora for Indian languages

03

Identification of key challenges like low-resource languages

Abstract

Automatic spoken language identification (LID) is a very important research field in the era of multilingual voice-command-based human-computer interaction (HCI). A front-end LID module helps to improve the performance of many speech-based applications in the multilingual scenario. India is a populous country with diverse cultures and languages. The majority of the Indian population needs to use their respective native languages for verbal interaction with machines. Therefore, the development of efficient Indian spoken language recognition systems is useful for adapting smart technologies in every section of Indian society. The field of Indian LID has started gaining momentum in the last two decades, mainly due to the development of several standard multilingual speech corpora for the Indian languages. Even though significant research progress has already been made in this field, to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.