Convergence Analysis of Real-time Recurrent Learning (RTRL) for a class   of Recurrent Neural Networks

Samuel Chun-Hei Lam; Justin Sirignano; Konstantinos Spiliopoulos

arXiv:2501.08040·cs.LG·January 15, 2025

Convergence Analysis of Real-time Recurrent Learning (RTRL) for a class of Recurrent Neural Networks

Samuel Chun-Hei Lam, Justin Sirignano, Konstantinos Spiliopoulos

PDF

Open Access

TL;DR

This paper proves the convergence of the RTRL algorithm for a class of RNNs, demonstrating its potential for exact online training on long sequences despite computational costs.

Contribution

It provides the first convergence proof for RTRL in a specific class of RNNs, establishing theoretical foundations for its use in long sequence data analysis.

Findings

01

RTRL converges to a stationary point of the loss function.

02

Theoretical analysis of RTRL's fixed point for data, hidden states, and derivatives.

03

Numerical studies support the convergence results.

Abstract

Recurrent neural networks (RNNs) are commonly trained with the truncated backpropagation-through-time (TBPTT) algorithm. For the purposes of computational tractability, the TBPTT algorithm truncates the chain rule and calculates the gradient on a finite block of the overall data sequence. Such approximation could lead to significant inaccuracies, as the block length for the truncated backpropagation is typically limited to be much smaller than the overall sequence length. In contrast, Real-time recurrent learning (RTRL) is an online optimization algorithm which asymptotically follows the true gradient of the loss on the data sequence as the number of sequence time steps $t \to \infty$ . RTRL forward propagates the derivatives of the RNN hidden/memory units with respect to the parameters and, using the forward derivatives, performs online updates of the parameters at each time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications