Communication-Efficient Learning of Deep Networks from Decentralized   Data

H. Brendan McMahan; Eider Moore; Daniel Ramage; Seth Hampson; Blaise; Ag\"uera y Arcas

arXiv:1602.05629·cs.LG·January 30, 2023·5.2k cites

Communication-Efficient Learning of Deep Networks from Decentralized Data

H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise, Ag\"uera y Arcas

PDF

Open Access 5 Repos

TL;DR

This paper introduces federated learning, a decentralized approach for training deep neural networks on mobile devices with privacy-sensitive data, achieving significant reductions in communication costs through iterative model averaging.

Contribution

It presents a practical federated learning method for deep networks, demonstrating robustness to non-IID data and substantial communication efficiency improvements.

Findings

01

Achieves 10-100x reduction in communication rounds

02

Demonstrates robustness to unbalanced, non-IID data

03

Validates approach across multiple models and datasets

Abstract

Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image models can automatically select good photos. However, this rich data is often privacy sensitive, large in quantity, or both, which may preclude logging to the data center and training there using conventional approaches. We advocate an alternative that leaves the training data distributed on the mobile devices, and learns a shared model by aggregating locally-computed updates. We term this decentralized approach Federated Learning. We present a practical method for the federated learning of deep networks based on iterative model averaging, and conduct an extensive empirical evaluation, considering five different model architectures and four datasets.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Human Mobility and Location-Based Analysis · Stochastic Gradient Optimization Techniques