Wakeword Detection under Distribution Shifts

Sree Hari Krishnan Parthasarathi; Lu Zeng; Christin Jose; Joseph Wang

arXiv:2207.06423·cs.SD·July 15, 2022

Wakeword Detection under Distribution Shifts

Sree Hari Krishnan Parthasarathi, Lu Zeng, Christin Jose, Joseph Wang

PDF

Open Access

TL;DR

This paper introduces a semi-supervised learning method for wakeword detection that effectively handles distribution shifts between training and deployment data, improving false discovery rates significantly.

Contribution

It proposes a novel teacher/student training framework with confidence-based labeling and label distribution matching to address distribution shifts in keyword spotting.

Findings

01

14.3% relative FDR reduction under distribution shift

02

5% FDR improvement without shift

03

52% relative FDR reduction under severe shift

Abstract

We propose a novel approach for semi-supervised learning (SSL) designed to overcome distribution shifts between training and real-world data arising in the keyword spotting (KWS) task. Shifts from training data distribution are a key challenge for real-world KWS tasks: when a new model is deployed on device, the gating of the accepted data undergoes a shift in distribution, making the problem of timely updates via subsequent deployments hard. Despite the shift, we assume that the marginal distributions on labels do not change. We utilize a modified teacher/student training framework, where labeled training data is augmented with unlabeled data. Note that the teacher does not have access to the new distribution as well. To train effectively with a mix of human and teacher labeled data, we develop a teacher labeling strategy based on confidence heuristics to reduce entropy on the label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis