Scaling Equilibrium Propagation to Deep ConvNets by Drastically Reducing   its Gradient Estimator Bias

Axel Laborieux; Maxence Ernoult; Benjamin Scellier; Yoshua Bengio,; Julie Grollier; Damien Querlioz

arXiv:2006.03824·cs.NE·June 9, 2020

Scaling Equilibrium Propagation to Deep ConvNets by Drastically Reducing its Gradient Estimator Bias

Axel Laborieux, Maxence Ernoult, Benjamin Scellier, Yoshua Bengio,, Julie Grollier, Damien Querlioz

PDF

1 Repo

TL;DR

This paper improves Equilibrium Propagation (EP) by reducing gradient bias through symmetric nudging, enabling training of deep ConvNets and achieving competitive results on CIFAR-10, thus enhancing EP's scalability and biological plausibility.

Contribution

It introduces a bias reduction technique using symmetric nudging and generalizes EP equations to cross-entropy loss, allowing deep network training with EP.

Findings

01

Achieved 11.7% test error on CIFAR-10 with EP.

02

Reduced gradient bias enables training deep ConvNets with EP.

03

Demonstrated EP's potential as a biologically plausible learning algorithm.

Abstract

Equilibrium Propagation (EP) is a biologically-inspired algorithm for convergent RNNs with a local learning rule that comes with strong theoretical guarantees. The parameter updates of the neural network during the credit assignment phase have been shown mathematically to approach the gradients provided by Backpropagation Through Time (BPTT) when the network is infinitesimally nudged toward its target. In practice, however, training a network with the gradient estimates provided by EP does not scale to visual tasks harder than MNIST. In this work, we show that a bias in the gradient estimate of EP, inherent in the use of finite nudging, is responsible for this phenomenon and that cancelling it allows training deep ConvNets by EP. We show that this bias can be greatly reduced by using symmetric nudging (a positive nudging and a negative one). We also generalize previous EP equations to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Laborieux-Axel/Equilibrium-Propagation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.