Avoid Overfitting User Specific Information in Federated Keyword Spotting
Xin-Chun Li, Jin-Lin Tang, Shaoming Song, Bingshuai Li, Yinchuan Li,, Yunfeng Shao, Le Gan, De-Chuan Zhan

TL;DR
This paper introduces FedKWS-UI, a federated keyword spotting method that employs adversarial learning and adaptive training to prevent overfitting to user-specific data, enhancing privacy and robustness.
Contribution
It proposes novel strategies, including adversarial learning and adaptive local training, to reduce overfitting to user-specific information in federated keyword spotting models.
Findings
FedKWS-UI effectively captures user-invariant features.
Experimental results show improved generalization across users.
The approach enhances privacy by preventing overfitting to individual data.
Abstract
Keyword spotting (KWS) aims to discriminate a specific wake-up word from other signals precisely and efficiently for different users. Recent works utilize various deep networks to train KWS models with all users' speech data centralized without considering data privacy. Federated KWS (FedKWS) could serve as a solution without directly sharing users' data. However, the small amount of data, different user habits, and various accents could lead to fatal problems, e.g., overfitting or weight divergence. Hence, we propose several strategies to encourage the model not to overfit user-specific information in FedKWS. Specifically, we first propose an adversarial learning strategy, which updates the downloaded global model against an overfitted local model and explicitly encourages the global model to capture user-invariant information. Furthermore, we propose an adaptive local training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Topic Modeling
