When Learning Hurts: Fixed-Pole RNN for Real-Time Online Training
Alexander Morgan, Ummay Sumaya Khan, Lingjia Liu, Lizhong Zheng

TL;DR
This paper demonstrates that fixing recurrent poles in RNNs leads to more stable, efficient, and effective online learning in data-constrained scenarios, compared to learning the poles.
Contribution
The work provides both analytical and empirical evidence that fixing recurrent poles in RNNs improves stability and reduces training complexity in real-time online learning.
Findings
Fixed-pole RNNs outperform learned-pole RNNs in data-limited settings.
Learning poles increases non-convexity and training difficulty.
Fixed-pole architectures yield better performance with less training data.
Abstract
Recurrent neural networks (RNNs) can be interpreted as discrete-time state-space models, where the state evolution corresponds to an infinite-impulse-response (IIR) filtering operation governed by both feedforward weights and recurrent poles. While, in principle, all parameters including pole locations can be optimized via backpropagation through time (BPTT), such joint learning incurs substantial computational overhead and is often impractical for applications with limited training data. Echo state networks (ESNs) mitigate this limitation by fixing the recurrent dynamics and training only a linear readout, enabling efficient and stable online adaptation. In this work, we analytically and empirically examine why learning recurrent poles does not provide tangible benefits in data-constrained, real-time learning scenarios. Our analysis shows that pole learning renders the weight…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis
