Online Continual Learning of End-to-End Speech Recognition Models

Muqiao Yang; Ian Lane; Shinji Watanabe

arXiv:2207.05071·cs.LG·July 13, 2022

Online Continual Learning of End-to-End Speech Recognition Models

Muqiao Yang, Ian Lane, Shinji Watanabe

PDF

Open Access

TL;DR

This paper introduces an online continual learning approach for end-to-end speech recognition, enabling models to adapt incrementally with lower computational costs while maintaining high accuracy.

Contribution

It proposes an online continual learning framework with Gradient Episodic Memory for speech recognition, including a selective sampling strategy and validation with SSL features.

Findings

01

Incremental updates achieve similar accuracy to retraining from scratch.

02

The method reduces computational costs significantly.

03

Self-supervised learning features improve performance.

Abstract

Continual Learning, also known as Lifelong Learning, aims to continually learn from new data as it becomes available. While prior research on continual learning in automatic speech recognition has focused on the adaptation of models across multiple different speech recognition tasks, in this paper we propose an experimental setting for \textit{online continual learning} for automatic speech recognition of a single task. Specifically focusing on the case where additional training data for the same task becomes available incrementally over time, we demonstrate the effectiveness of performing incremental model updates to end-to-end speech recognition models with an online Gradient Episodic Memory (GEM) method. Moreover, we show that with online continual learning and a selective sampling strategy, we can maintain an accuracy that is similar to retraining a model from scratch while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning