Collaborative Deep Learning in Fixed Topology Networks

Zhanhong Jiang; Aditya Balu; Chinmay Hegde; Soumik Sarkar

arXiv:1706.07880·stat.ML·June 27, 2017·77 cites

Collaborative Deep Learning in Fixed Topology Networks

Zhanhong Jiang, Aditya Balu, Chinmay Hegde, Soumik Sarkar

PDF

Open Access

TL;DR

This paper introduces a consensus-based distributed SGD algorithm for collaborative deep learning over fixed topology networks, enabling data parallelization and decentralized computation in communication-constrained environments.

Contribution

It presents a novel consensus-based distributed SGD method with convergence analysis for fixed topology networks, addressing data privacy and communication constraints.

Findings

01

The proposed algorithms outperform centralized SGD in certain settings.

02

Demonstrated effectiveness on benchmark datasets like MNIST, CIFAR-10, and CIFAR-100.

03

Convergence properties established for both convex and nonconvex objectives.

Abstract

There is significant recent interest to parallelize deep learning algorithms in order to handle the enormous growth in data and model sizes. While most advances focus on model parallelization and engaging multiple computing agents via using a central parameter server, aspect of data parallelization along with decentralized computation has not been explored sufficiently. In this context, this paper presents a new consensus-based distributed SGD (CDSGD) (and its momentum variant, CDMSGD) algorithm for collaborative deep learning over fixed topology networks that enables data parallelization as well as decentralized computation. Such a framework can be extremely useful for learning agents with access to only local/private data in a communication constrained environment. We analyze the convergence properties of the proposed algorithm with strongly convex and nonconvex objective functions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques · Advanced Memory and Neural Computing

MethodsStochastic Gradient Descent