Large-Scale Gradient-Free Deep Learning with Recursive Local   Representation Alignment

Alexander Ororbia; Ankur Mali; Daniel Kifer; C. Lee Giles

arXiv:2002.03911·cs.LG·September 22, 2020·5 cites

Large-Scale Gradient-Free Deep Learning with Recursive Local Representation Alignment

Alexander Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles

PDF

Open Access 1 Repo

TL;DR

This paper introduces a gradient-free, biologically plausible training method for deep neural networks, demonstrating comparable performance to backpropagation on large datasets like ImageNet with faster convergence.

Contribution

The paper presents recursive local representation alignment, a novel gradient-free training algorithm that scales to large datasets and neural architectures, offering an alternative to backpropagation.

Findings

01

Achieves similar accuracy to backprop on ImageNet

02

Converges faster due to parallelizable weight updates

03

Requires less computational resources

Abstract

Training deep neural networks on large-scale datasets requires significant hardware resources whose costs (even on cloud platforms) put them out of reach of smaller organizations, groups, and individuals. Backpropagation, the workhorse for training these networks, is an inherently sequential process that is difficult to parallelize. Furthermore, it requires researchers to continually develop various tricks, such as specialized weight initializations and activation functions, in order to ensure a stable parameter optimization. Our goal is to seek an effective, neuro-biologically-plausible alternative to backprop that can be used to train deep networks. In this paper, we propose a gradient-free learning procedure, recursive local representation alignment, for training large-scale neural architectures. Experiments with residual networks on CIFAR-10 and the large benchmark, ImageNet, show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liukidar/pcax
jax

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning