TL;DR
This paper introduces a decentralized deep learning protocol that significantly reduces communication costs while maintaining high predictive accuracy, adaptable to concept drifts and suitable for mobile and autonomous applications.
Contribution
It presents a dynamic model averaging protocol that reduces communication by an order of magnitude without sacrificing performance, with theoretical bounds and empirical validation.
Findings
Reduces communication by an order of magnitude compared to existing methods.
Maintains predictive performance and loss bounds similar to periodic averaging schemes.
Validates effectiveness through extensive empirical evaluation.
Abstract
We propose an efficient protocol for decentralized training of deep neural networks from distributed data sources. The proposed protocol allows to handle different phases of model training equally well and to quickly adapt to concept drifts. This leads to a reduction of communication by an order of magnitude compared to periodically communicating state-of-the-art approaches. Moreover, we derive a communication bound that scales well with the hardness of the serialized learning problem. The reduction in communication comes at almost no cost, as the predictive performance remains virtually unchanged. Indeed, the proposed protocol retains loss bounds of periodically averaging schemes. An extensive empirical evaluation validates major improvement of the trade-off between model performance and communication which could be beneficial for numerous decentralized learning applications, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
