Efficient Decentralized Deep Learning by Dynamic Model Averaging

Michael Kamp; Linara Adilova; Joachim Sicking; Fabian H\"uger; and Peter Schlicht; Tim Wirtz; Stefan Wrobel

arXiv:1807.03210·cs.LG·November 14, 2018

Efficient Decentralized Deep Learning by Dynamic Model Averaging

Michael Kamp, Linara Adilova, Joachim Sicking, Fabian H\"uger, and Peter Schlicht, Tim Wirtz, Stefan Wrobel

PDF

1 Repo

TL;DR

This paper introduces a decentralized deep learning protocol that significantly reduces communication costs while maintaining high predictive accuracy, adaptable to concept drifts and suitable for mobile and autonomous applications.

Contribution

It presents a dynamic model averaging protocol that reduces communication by an order of magnitude without sacrificing performance, with theoretical bounds and empirical validation.

Findings

01

Reduces communication by an order of magnitude compared to existing methods.

02

Maintains predictive performance and loss bounds similar to periodic averaging schemes.

03

Validates effectiveness through extensive empirical evaluation.

Abstract

We propose an efficient protocol for decentralized training of deep neural networks from distributed data sources. The proposed protocol allows to handle different phases of model training equally well and to quickly adapt to concept drifts. This leads to a reduction of communication by an order of magnitude compared to periodically communicating state-of-the-art approaches. Moreover, we derive a communication bound that scales well with the hardness of the serialized learning problem. The reduction in communication comes at almost no cost, as the predictive performance remains virtually unchanged. Indeed, the proposed protocol retains loss bounds of periodically averaging schemes. An extensive empirical evaluation validates major improvement of the trade-off between model performance and communication which could be beneficial for numerous decentralized learning applications, such as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fraunhofer-iais/dlplatform/tree/master/DLplatform
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.