Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning
Eugene Belilovsky (MILA), Louis Leconte (MLIA, CMAP), Lucas Caccia, (MILA), Michael Eickenberg, Edouard Oyallon (MLIA)

TL;DR
This paper introduces Decoupled Greedy Learning (DGL), a method that enables parallel and asynchronous training of CNN layers, reducing communication overhead and addressing update locking in distributed neural network training.
Contribution
It proposes a novel decoupled greedy training approach for CNNs that allows for linear parallelization and asynchronous updates, with bandwidth reduction via online vector quantization.
Findings
DGL achieves effective training on CIFAR-10 and ImageNet datasets.
The approach converges both theoretically and empirically.
DGL outperforms some existing methods in distributed CNN training.
Abstract
A commonly cited inefficiency of neural network training using back-propagation is the update locking problem: each layer must wait for the signal to propagate through the full network before updating. Several alternatives that can alleviate this issue have been proposed. In this context, we consider a simple alternative based on minimal feedback, which we call Decoupled Greedy Learning (DGL). It is based on a classic greedy relaxation of the joint training objective, recently shown to be effective in the context of Convolutional Neural Networks (CNNs) on large-scale image classification. We consider an optimization of this objective that permits us to decouple the layer training, allowing for layers or modules in networks to be trained with a potentially linear parallelization. With the use of a replay buffer we show that this approach can be extended to asynchronous settings, where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Brain Tumor Detection and Classification · Machine Learning and ELM
