Gradient-Free Training of Recurrent Neural Networks using Random Perturbations
Jesus Garcia Fernandez, Sander Keemink, Marcel van Gerven

TL;DR
This paper introduces a time-domain activity-based node perturbation method for training recurrent neural networks, achieving performance comparable to backpropagation through time while avoiding its computational drawbacks.
Contribution
The study extends activity-based node perturbation to the temporal domain, providing an efficient alternative to BPTT for training RNNs with similar effectiveness.
Findings
Performance comparable to BPTT in convergence and scalability
Outperforms standard perturbation methods
Suitable for neuromorphic computing applications
Abstract
Recurrent neural networks (RNNs) hold immense potential for computations due to their Turing completeness and sequential processing capabilities, yet existing methods for their training encounter efficiency challenges. Backpropagation through time (BPTT), the prevailing method, extends the backpropagation (BP) algorithm by unrolling the RNN over time. However, this approach suffers from significant drawbacks, including the need to interleave forward and backward phases and store exact gradient information. Furthermore, BPTT has been shown to struggle to propagate gradient information for long sequences, leading to vanishing gradients. An alternative strategy to using gradient-based methods like BPTT involves stochastically approximating gradients through perturbation-based methods. This learning approach is exceptionally simple, necessitating only forward passes in the network and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
