Semi-supervised and Active-learning Scenarios: Efficient Acoustic Model   Refinement for a Low Resource Indian Language

Maharajan Chellapriyadharshini; Anoop Toffy; Srinivasa Raghavan K. M.,; V Ramasubramanian

arXiv:1810.06635·cs.CL·October 17, 2018

Semi-supervised and Active-learning Scenarios: Efficient Acoustic Model Refinement for a Low Resource Indian Language

Maharajan Chellapriyadharshini, Anoop Toffy, Srinivasa Raghavan K. M.,, V Ramasubramanian

PDF

TL;DR

This paper presents semi-supervised and active learning methods for efficient acoustic model refinement in low-resource Indian languages, significantly reducing manual labeling efforts while improving speech recognition accuracy.

Contribution

It introduces protocols that leverage confidence-based decoding and iterative bootstrapping to enhance acoustic models with minimal manual labeling.

Findings

01

Semi-supervised learning reduces WER by up to 50%.

02

Active learning achieves comparable performance with only 60% manual labeling.

03

Protocols enable effective model refinement in low-resource settings.

Abstract

We address the problem of efficient acoustic-model refinement (continuous retraining) using semi-supervised and active learning for a low resource Indian language, wherein the low resource constraints are having i) a small labeled corpus from which to train a baseline `seed' acoustic model and ii) a large training corpus without orthographic labeling or from which to perform a data selection for manual labeling at low costs. The proposed semi-supervised learning decodes the unlabeled large training corpus using the seed model and through various protocols, selects the decoded utterances with high reliability using confidence levels (that correlate to the WER of the decoded utterances) and iterative bootstrapping. The proposed active learning protocol uses confidence level based metric to select the decoded utterances from the large unlabeled corpus for further labeling. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.