Large-Scale Gradient-Free Deep Learning with Recursive Local Representation Alignment
Alexander Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles

TL;DR
This paper introduces a gradient-free, biologically plausible training method for deep neural networks, demonstrating comparable performance to backpropagation on large datasets like ImageNet with faster convergence.
Contribution
The paper presents recursive local representation alignment, a novel gradient-free training algorithm that scales to large datasets and neural architectures, offering an alternative to backpropagation.
Findings
Achieves similar accuracy to backprop on ImageNet
Converges faster due to parallelizable weight updates
Requires less computational resources
Abstract
Training deep neural networks on large-scale datasets requires significant hardware resources whose costs (even on cloud platforms) put them out of reach of smaller organizations, groups, and individuals. Backpropagation, the workhorse for training these networks, is an inherently sequential process that is difficult to parallelize. Furthermore, it requires researchers to continually develop various tricks, such as specialized weight initializations and activation functions, in order to ensure a stable parameter optimization. Our goal is to seek an effective, neuro-biologically-plausible alternative to backprop that can be used to train deep networks. In this paper, we propose a gradient-free learning procedure, recursive local representation alignment, for training large-scale neural architectures. Experiments with residual networks on CIFAR-10 and the large benchmark, ImageNet, show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
