Improved Meta Learning for Low Resource Speech Recognition

Satwinder Singh; Ruili Wang; Feng Hou

arXiv:2205.06182·cs.CL·May 13, 2022

Improved Meta Learning for Low Resource Speech Recognition

Satwinder Singh, Ruili Wang, Feng Hou

PDF

TL;DR

This paper introduces an enhanced meta learning framework for low resource speech recognition that addresses MAML's training issues by incorporating multi-step loss, resulting in more stable training and better accuracy across languages.

Contribution

The paper proposes a multi-step loss approach to improve MAML's stability and performance in low resource speech recognition tasks.

Findings

01

Significantly improved training stability.

02

Outperforms MAML in character error rates.

03

Effective across multiple languages.

Abstract

We propose a new meta learning based framework for low resource speech recognition that improves the previous model agnostic meta learning (MAML) approach. The MAML is a simple yet powerful meta learning approach. However, the MAML presents some core deficiencies such as training instabilities and slower convergence speed. To address these issues, we adopt multi-step loss (MSL). The MSL aims to calculate losses at every step of the inner loop of MAML and then combines them with a weighted importance vector. The importance vector ensures that the loss at the last step has more importance than the previous steps. Our empirical evaluation shows that MSL significantly improves the stability of the training procedure and it thus also improves the accuracy of the overall system. Our proposed system outperforms MAML based low resource ASR system on various languages in terms of character error…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsModel-Agnostic Meta-Learning