Production federated keyword spotting via distillation, filtering, and   joint federated-centralized training

Andrew Hard; Kurt Partridge; Neng Chen; Sean Augenstein; Aishanee; Shah; Hyun Jin Park; Alex Park; Sara Ng; Jessica Nguyen; Ignacio Lopez; Moreno; Rajiv Mathews; Fran\c{c}oise Beaufays

arXiv:2204.06322·eess.AS·July 1, 2022

Production federated keyword spotting via distillation, filtering, and joint federated-centralized training

Andrew Hard, Kurt Partridge, Neng Chen, Sean Augenstein, Aishanee, Shah, Hyun Jin Park, Alex Park, Sara Ng, Jessica Nguyen, Ignacio Lopez, Moreno, Rajiv Mathews, Fran\c{c}oise Beaufays

PDF

Open Access

TL;DR

This paper presents a federated learning approach for keyword spotting that combines distillation, filtering, and joint training to improve model performance on user devices, addressing data domain gaps and unlabeled data.

Contribution

It introduces a novel federated training framework with confidence filtering and joint federated-centralized training for keyword spotting on mobile devices.

Findings

01

Significant improvements in offline quality metrics.

02

Enhanced user experience in live A/B tests.

03

Effective handling of unlabeled data through confidence filtering.

Abstract

We trained a keyword spotting model using federated learning on real user devices and observed significant improvements when the model was deployed for inference on phones. To compensate for data domains that are missing from on-device training caches, we employed joint federated-centralized training. And to learn in the absence of curated labels on-device, we formulated a confidence filtering strategy based on user-feedback signals for federated distillation. These techniques created models that significantly improved quality metrics in offline evaluations and user-experience metrics in live A/B experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPersonal Information Management and User Behavior · Digital Mental Health Interventions · Human Mobility and Location-Based Analysis