Federated Domain Adaptation for ASR with Full Self-Supervision
Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem, Kalinli, Frank Seide

TL;DR
This paper introduces a federated learning approach for on-device speech recognition that leverages self-supervision and data augmentation to improve performance without ground-truth transcriptions, reducing costs and preserving privacy.
Contribution
It proposes a novel federated domain adaptation system for ASR using full self-supervision and a cost-effective training method with selective weight adaptation.
Findings
Achieved 12.9% relative WER reduction over out-of-domain baseline
Reduced training cost with self-restricted RNN-T loss and partial weight adaptation
Attained 70% of the performance gain of fully supervised centralized training
Abstract
Cross-device federated learning (FL) protects user privacy by collaboratively training a model on user devices, therefore eliminating the need for collecting, storing, and manually labeling user data. While important topics such as the FL training algorithm, non-IID-ness, and Differential Privacy have been well studied in the literature, this paper focuses on two challenges of practical importance for improving on-device ASR: the lack of ground-truth transcriptions and the scarcity of compute resource and network bandwidth on edge devices. First, we propose a FL system for on-device ASR domain adaptation with full self-supervision, which uses self-labeling together with data augmentation and filtering techniques. The system can improve a strong Emformer-Transducer based ASR model pretrained on out-of-domain data, using in-domain audio without any ground-truth transcriptions. Second, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
