TL;DR
This paper improves Equilibrium Propagation (EP) by reducing gradient bias through symmetric nudging, enabling training of deep ConvNets and achieving competitive results on CIFAR-10, thus enhancing EP's scalability and biological plausibility.
Contribution
It introduces a bias reduction technique using symmetric nudging and generalizes EP equations to cross-entropy loss, allowing deep network training with EP.
Findings
Achieved 11.7% test error on CIFAR-10 with EP.
Reduced gradient bias enables training deep ConvNets with EP.
Demonstrated EP's potential as a biologically plausible learning algorithm.
Abstract
Equilibrium Propagation (EP) is a biologically-inspired algorithm for convergent RNNs with a local learning rule that comes with strong theoretical guarantees. The parameter updates of the neural network during the credit assignment phase have been shown mathematically to approach the gradients provided by Backpropagation Through Time (BPTT) when the network is infinitesimally nudged toward its target. In practice, however, training a network with the gradient estimates provided by EP does not scale to visual tasks harder than MNIST. In this work, we show that a bias in the gradient estimate of EP, inherent in the use of finite nudging, is responsible for this phenomenon and that cancelling it allows training deep ConvNets by EP. We show that this bias can be greatly reduced by using symmetric nudging (a positive nudging and a negative one). We also generalize previous EP equations to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
