Wakeword Detection under Distribution Shifts
Sree Hari Krishnan Parthasarathi, Lu Zeng, Christin Jose, Joseph Wang

TL;DR
This paper introduces a semi-supervised learning method for wakeword detection that effectively handles distribution shifts between training and deployment data, improving false discovery rates significantly.
Contribution
It proposes a novel teacher/student training framework with confidence-based labeling and label distribution matching to address distribution shifts in keyword spotting.
Findings
14.3% relative FDR reduction under distribution shift
5% FDR improvement without shift
52% relative FDR reduction under severe shift
Abstract
We propose a novel approach for semi-supervised learning (SSL) designed to overcome distribution shifts between training and real-world data arising in the keyword spotting (KWS) task. Shifts from training data distribution are a key challenge for real-world KWS tasks: when a new model is deployed on device, the gating of the accepted data undergoes a shift in distribution, making the problem of timely updates via subsequent deployments hard. Despite the shift, we assume that the marginal distributions on labels do not change. We utilize a modified teacher/student training framework, where labeled training data is augmented with unlabeled data. Note that the teacher does not have access to the new distribution as well. To train effectively with a mix of human and teacher labeled data, we develop a teacher labeling strategy based on confidence heuristics to reduce entropy on the label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis
