Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
Andrew Hard, Kurt Partridge, Neng Chen, Sean Augenstein, Aishanee, Shah, Hyun Jin Park, Alex Park, Sara Ng, Jessica Nguyen, Ignacio Lopez, Moreno, Rajiv Mathews, Fran\c{c}oise Beaufays

TL;DR
This paper presents a federated learning approach for keyword spotting that combines distillation, filtering, and joint training to improve model performance on user devices, addressing data domain gaps and unlabeled data.
Contribution
It introduces a novel federated training framework with confidence filtering and joint federated-centralized training for keyword spotting on mobile devices.
Findings
Significant improvements in offline quality metrics.
Enhanced user experience in live A/B tests.
Effective handling of unlabeled data through confidence filtering.
Abstract
We trained a keyword spotting model using federated learning on real user devices and observed significant improvements when the model was deployed for inference on phones. To compensate for data domains that are missing from on-device training caches, we employed joint federated-centralized training. And to learn in the absence of curated labels on-device, we formulated a confidence filtering strategy based on user-feedback signals for federated distillation. These techniques created models that significantly improved quality metrics in offline evaluations and user-experience metrics in live A/B experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersonal Information Management and User Behavior · Digital Mental Health Interventions · Human Mobility and Location-Based Analysis
