Improved Meta Learning for Low Resource Speech Recognition
Satwinder Singh, Ruili Wang, Feng Hou

TL;DR
This paper introduces an enhanced meta learning framework for low resource speech recognition that addresses MAML's training issues by incorporating multi-step loss, resulting in more stable training and better accuracy across languages.
Contribution
The paper proposes a multi-step loss approach to improve MAML's stability and performance in low resource speech recognition tasks.
Findings
Significantly improved training stability.
Outperforms MAML in character error rates.
Effective across multiple languages.
Abstract
We propose a new meta learning based framework for low resource speech recognition that improves the previous model agnostic meta learning (MAML) approach. The MAML is a simple yet powerful meta learning approach. However, the MAML presents some core deficiencies such as training instabilities and slower convergence speed. To address these issues, we adopt multi-step loss (MSL). The MSL aims to calculate losses at every step of the inner loop of MAML and then combines them with a weighted importance vector. The importance vector ensures that the loss at the last step has more importance than the previous steps. Our empirical evaluation shows that MSL significantly improves the stability of the training procedure and it thus also improves the accuracy of the overall system. Our proposed system outperforms MAML based low resource ASR system on various languages in terms of character error…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsModel-Agnostic Meta-Learning
