Convergence Analysis of Real-time Recurrent Learning (RTRL) for a class of Recurrent Neural Networks
Samuel Chun-Hei Lam, Justin Sirignano, Konstantinos Spiliopoulos

TL;DR
This paper proves the convergence of the RTRL algorithm for a class of RNNs, demonstrating its potential for exact online training on long sequences despite computational costs.
Contribution
It provides the first convergence proof for RTRL in a specific class of RNNs, establishing theoretical foundations for its use in long sequence data analysis.
Findings
RTRL converges to a stationary point of the loss function.
Theoretical analysis of RTRL's fixed point for data, hidden states, and derivatives.
Numerical studies support the convergence results.
Abstract
Recurrent neural networks (RNNs) are commonly trained with the truncated backpropagation-through-time (TBPTT) algorithm. For the purposes of computational tractability, the TBPTT algorithm truncates the chain rule and calculates the gradient on a finite block of the overall data sequence. Such approximation could lead to significant inaccuracies, as the block length for the truncated backpropagation is typically limited to be much smaller than the overall sequence length. In contrast, Real-time recurrent learning (RTRL) is an online optimization algorithm which asymptotically follows the true gradient of the loss on the data sequence as the number of sequence time steps . RTRL forward propagates the derivatives of the RNN hidden/memory units with respect to the parameters and, using the forward derivatives, performs online updates of the parameters at each time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
