Federated Learning for Mobile Keyboard Prediction
Andrew Hard, Kanishka Rao, Rajiv Mathews, Swaroop Ramaswamy,, Fran\c{c}oise Beaufays, Sean Augenstein, Hubert Eichner, Chlo\'e Kiddon,, Daniel Ramage

TL;DR
This paper demonstrates that federated learning can effectively train language models on mobile devices for keyboard prediction, improving privacy and prediction recall without exporting sensitive data.
Contribution
It introduces a federated learning framework for on-device language model training, showing improved prediction recall and enhanced user privacy compared to server-based methods.
Findings
Federated learning achieves better prediction recall than server-based training.
Training on-device enhances user privacy by not exporting sensitive data.
The approach is feasible for real-world mobile keyboard applications.
Abstract
We train a recurrent neural network language model using a distributed, on-device learning framework called federated learning for the purpose of next-word prediction in a virtual keyboard for smartphones. Server-based training using stochastic gradient descent is compared with training on client devices using the Federated Averaging algorithm. The federated algorithm, which enables training on a higher-quality dataset for this use case, is shown to achieve better prediction recall. This work demonstrates the feasibility and benefit of training language models on client devices without exporting sensitive user data to servers. The federated learning environment gives users greater control over the use of their data and simplifies the task of incorporating privacy by default with distributed training and aggregation across a population of client devices.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · User Authentication and Security Systems · Internet Traffic Analysis and Secure E-voting
