Federated Optimization: Distributed Machine Learning for On-Device Intelligence
Jakub Kone\v{c}n\'y, H. Brendan McMahan, Daniel Ramage, Peter, Richt\'arik

TL;DR
This paper introduces Federated Optimization, a new distributed learning setting where data is unevenly spread across many devices, emphasizing communication efficiency and proposing a novel algorithm suitable for sparse convex problems.
Contribution
It defines the federated optimization setting, highlights the limitations of existing algorithms, and proposes a new algorithm tailored for this scenario with promising experimental results.
Findings
Existing algorithms are unsuitable for federated optimization.
The proposed algorithm performs well on sparse convex problems.
Federated optimization sets a foundation for future research in distributed on-device learning.
Abstract
We introduce a new and increasingly relevant setting for distributed optimization in machine learning, where the data defining the optimization are unevenly distributed over an extremely large number of nodes. The goal is to train a high-quality centralized model. We refer to this setting as Federated Optimization. In this setting, communication efficiency is of the utmost importance and minimizing the number of rounds of communication is the principal goal. A motivating example arises when we keep the training data locally on users' mobile devices instead of logging it to a data center for training. In federated optimziation, the devices are used as compute nodes performing computation on their local data in order to update a global model. We suppose that we have extremely large number of devices in the network --- as many as the number of users of a given service, each of which has…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Cryptography and Data Security
