Linear RNNs Provably Learn Linear Dynamic Systems

Lifu Wang; Tianyu Wang; Shengwei Yi; Bo Shen; Bo Hu; Xing Cao

arXiv:2211.10582·cs.LG·October 24, 2023

Linear RNNs Provably Learn Linear Dynamic Systems

Lifu Wang, Tianyu Wang, Shengwei Yi, Bo Shen, Bo Hu, Xing Cao

PDF

Open Access

TL;DR

This paper provides the first theoretical proof that linear RNNs can learn any stable linear dynamic system efficiently using gradient descent, highlighting the benefits of recurrent structure in learning dynamics.

Contribution

It establishes the first theoretical guarantee for linear RNNs to learn stable linear systems, with polynomial sample and time complexity independent of input sequence length.

Findings

01

Linear RNNs can learn stable linear systems with polynomial complexity.

02

The width of the RNN does not depend on input sequence length.

03

Recurrent structure aids in learning dynamic systems effectively.

Abstract

We study the learning ability of linear recurrent neural networks with Gradient Descent. We prove the first theoretical guarantee on linear RNNs to learn any stable linear dynamic system using any a large type of loss functions. For an arbitrary stable linear system with a parameter $ρ_{C}$ related to the transition matrix $C$ , we show that despite the non-convexity of the parameter optimization loss if the width of the RNN is large enough (and the required width in hidden layers does not rely on the length of the input sequence), a linear RNN can provably learn any stable linear dynamic system with the sample and time complexity polynomial in $\frac{1}{1 - ρ _{C}}$ . Our results provide the first theoretical guarantee to learn a linear RNN and demonstrate how can the recurrent structure help to learn a dynamic system.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Machine Learning and Algorithms