ILASR: Privacy-Preserving Incremental Learning for Automatic Speech   Recognition at Production Scale

Gopinath Chennupati; Milind Rao; Gurpreet Chadha; Aaron Eakin; Anirudh; Raju; Gautam Tiwari; Anit Kumar Sahu; Ariya Rastrow; Jasha Droppo; Andy; Oberlin; Buddha Nandanoor; Prahalad Venkataramanan; Zheng Wu; Pankaj Sitpure

arXiv:2207.09078·cs.CL·July 26, 2022

ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale

Gopinath Chennupati, Milind Rao, Gurpreet Chadha, Aaron Eakin, Anirudh, Raju, Gautam Tiwari, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo, Andy, Oberlin, Buddha Nandanoor, Prahalad Venkataramanan, Zheng Wu, Pankaj Sitpure

PDF

TL;DR

This paper presents ILASR, a privacy-preserving incremental learning system for end-to-end speech recognition that improves model performance over time without human annotations, suitable for production environments.

Contribution

It introduces a cloud-based framework for privacy-preserving incremental ASR learning, demonstrating significant model improvements with weak supervision and large batch sizes.

Findings

01

Achieved 3% improvement over six months without human labels

02

Enhanced recognition of new words and phrases by 20%

03

Validated effectiveness of large batch sizes and teacher models

Abstract

Incremental learning is one paradigm to enable model building and updating at scale with streaming data. For end-to-end automatic speech recognition (ASR) tasks, the absence of human annotated labels along with the need for privacy preserving policies for model building makes it a daunting challenge. Motivated by these challenges, in this paper we use a cloud based framework for production systems to demonstrate insights from privacy preserving incremental learning for automatic speech recognition (ILASR). By privacy preserving, we mean, usage of ephemeral data which are not human annotated. This system is a step forward for production levelASR models for incremental/continual learning that offers near real-time test-bed for experimentation in the cloud for end-to-end ASR, while adhering to privacy-preserving policies. We show that the proposed system can improve the production models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest