Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous   Distributed Learning

Eugene Belilovsky (MILA); Louis Leconte (MLIA; CMAP); Lucas Caccia; (MILA); Michael Eickenberg; Edouard Oyallon (MLIA)

arXiv:2106.06401·cs.LG·June 14, 2021·1 cites

Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning

Eugene Belilovsky (MILA), Louis Leconte (MLIA, CMAP), Lucas Caccia, (MILA), Michael Eickenberg, Edouard Oyallon (MLIA)

PDF

Open Access

TL;DR

This paper introduces Decoupled Greedy Learning (DGL), a method that enables parallel and asynchronous training of CNN layers, reducing communication overhead and addressing update locking in distributed neural network training.

Contribution

It proposes a novel decoupled greedy training approach for CNNs that allows for linear parallelization and asynchronous updates, with bandwidth reduction via online vector quantization.

Findings

01

DGL achieves effective training on CIFAR-10 and ImageNet datasets.

02

The approach converges both theoretically and empirically.

03

DGL outperforms some existing methods in distributed CNN training.

Abstract

A commonly cited inefficiency of neural network training using back-propagation is the update locking problem: each layer must wait for the signal to propagate through the full network before updating. Several alternatives that can alleviate this issue have been proposed. In this context, we consider a simple alternative based on minimal feedback, which we call Decoupled Greedy Learning (DGL). It is based on a classic greedy relaxation of the joint training objective, recently shown to be effective in the context of Convolutional Neural Networks (CNNs) on large-scale image classification. We consider an optimization of this objective that permits us to decouple the layer training, allowing for layers or modules in networks to be trained with a potentially linear parallelization. With the use of a replay buffer we show that this approach can be extended to asynchronous settings, where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Brain Tumor Detection and Classification · Machine Learning and ELM