Kernel Limit for a Class of Recurrent Neural Networks Trained on Ergodic Data Sequences
Samuel Chun-Hei Lam, Justin Sirignano, and Konstantinos Spiliopoulos

TL;DR
This paper develops mathematical methods to analyze the asymptotic behavior of recurrent neural networks (RNNs) with simplified weights, revealing their convergence to infinite-dimensional limits and establishing neural tangent kernel (NTK) limits for large-scale training.
Contribution
It introduces a fixed point analysis for RNN memory states and derives NTK limits for RNNs trained on ergodic data sequences, addressing challenges unique to RNNs.
Findings
RNNs converge to solutions of infinite-dimensional ODEs and fixed point equations.
Established NTK limits for RNNs with growing data and network size.
Provided convergence estimates for RNN memory state evolution.
Abstract
Mathematical methods are developed to characterize the asymptotics of recurrent neural networks (RNN) as the number of hidden units, data samples in the sequence, hidden state updates, and training steps simultaneously grow to infinity. In the case of an RNN with a simplified weight matrix, we prove the convergence of the RNN to the solution of an infinite-dimensional ODE coupled with the fixed point of a random algebraic equation. The analysis requires addressing several challenges which are unique to RNNs. In typical mean-field applications (e.g., feedforward neural networks), discrete updates are of magnitude and the number of updates is . Therefore, the system can be represented as an Euler approximation of an appropriate ODE/PDE, which it will converge to as . However, the RNN hidden layer updates are .…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications
