Gradient-Free Training of Recurrent Neural Networks using Random   Perturbations

Jesus Garcia Fernandez; Sander Keemink; Marcel van Gerven

arXiv:2405.08967·cs.LG·October 2, 2024

Gradient-Free Training of Recurrent Neural Networks using Random Perturbations

Jesus Garcia Fernandez, Sander Keemink, Marcel van Gerven

PDF

Open Access 1 Repo

TL;DR

This paper introduces a time-domain activity-based node perturbation method for training recurrent neural networks, achieving performance comparable to backpropagation through time while avoiding its computational drawbacks.

Contribution

The study extends activity-based node perturbation to the temporal domain, providing an efficient alternative to BPTT for training RNNs with similar effectiveness.

Findings

01

Performance comparable to BPTT in convergence and scalability

02

Outperforms standard perturbation methods

03

Suitable for neuromorphic computing applications

Abstract

Recurrent neural networks (RNNs) hold immense potential for computations due to their Turing completeness and sequential processing capabilities, yet existing methods for their training encounter efficiency challenges. Backpropagation through time (BPTT), the prevailing method, extends the backpropagation (BP) algorithm by unrolling the RNN over time. However, this approach suffers from significant drawbacks, including the need to interleave forward and backward phases and store exact gradient information. Furthermore, BPTT has been shown to struggle to propagate gradient information for long sequences, leading to vanishing gradients. An alternative strategy to using gradient-based methods like BPTT involves stochastically approximating gradients through perturbation-based methods. This learning approach is exceptionally simple, necessitating only forward passes in the network and a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jesusgf96/np_rnns
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications