Block-local learning with probabilistic latent representations
David Kappel, Khaleelulla Khan Nazeer, Cabrel Teguemne Fokam,, Christian Mayr, Anand Subramoney

TL;DR
This paper introduces a block-local learning method that enables parallel training of neural networks by dividing them into blocks with local losses, addressing the locking and weight transport problems of backpropagation.
Contribution
The paper proposes a novel block-local learning framework that allows parallel training and reduces dependency on weight transport, with a statistical interpretation of local errors.
Findings
Achieves state-of-the-art performance on various tasks.
Addresses locking and weight transport problems in neural training.
Enables scalable and distributed training of large models.
Abstract
The ubiquitous backpropagation algorithm requires sequential updates through the network introducing a locking problem. In addition, back-propagation relies on the transpose of forward weight matrices to compute updates, introducing a weight transport problem across the network. Locking and weight transport are problems because they prevent efficient parallelization and horizontal scaling of the training process. We propose a new method to address both these problems and scale up the training of large models. Our method works by dividing a deep neural network into blocks and introduces a feedback network that propagates the information from the targets backwards to provide auxiliary local losses. Forward and backward propagation can operate in parallel and with different sets of weights, addressing the problems of locking and weight transport. Our approach derives from a statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Human Mobility and Location-Based Analysis · Traffic Prediction and Management Techniques
