Memory Capacity of Nonlinear Recurrent Networks: Is it Informative?

Giovanni Ballarin; Lyudmila Grigoryeva; Juan-Pablo Ortega

arXiv:2502.04832·cs.LG·September 3, 2025

Memory Capacity of Nonlinear Recurrent Networks: Is it Informative?

Giovanni Ballarin, Lyudmila Grigoryeva, Juan-Pablo Ortega

PDF

Open Access 1 Repo

TL;DR

This paper investigates the memory capacity of nonlinear recurrent neural networks, revealing that it varies arbitrarily and questions the usefulness of existing metrics for assessing their performance in processing stochastic signals.

Contribution

It demonstrates that the memory capacity of nonlinear RNNs depends solely on input scale, challenging the relevance of traditional memory capacity metrics.

Findings

01

Memory capacity varies arbitrarily within bounds

02

Existing metrics lack practical value for nonlinear RNNs

03

Memory capacity depends only on input scale

Abstract

The total memory capacity (MC) of linear recurrent neural networks (RNNs) has been proven to be equal to the rank of the corresponding Kalman controllability matrix, and it is almost surely maximal for connectivity and input weight matrices drawn from regular distributions. This fact questions the usefulness of this metric in distinguishing the performance of linear RNNs in the processing of stochastic signals. This work shows that the MC of random nonlinear RNNs yields arbitrary values within established upper and lower bounds depending exclusively on the scale of the input process. This confirms that the existing definition of MC in linear and nonlinear cases has no practical value.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

learning-of-dynamic-processes/memorycapacity
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Quantum Computing Algorithms and Architecture · Advanced Memory and Neural Computing