An Uncertainty Principle for Linear Recurrent Neural Networks

Alexandre Fran\c{c}ois; Antonio Orvieto; Francis Bach

arXiv:2502.09287·cs.LG·February 14, 2025

An Uncertainty Principle for Linear Recurrent Neural Networks

Alexandre Fran\c{c}ois, Antonio Orvieto, Francis Bach

PDF

Open Access

TL;DR

This paper establishes an uncertainty principle for linear recurrent neural networks, showing a fundamental trade-off between the range of past information they can effectively utilize and the accuracy of the approximation.

Contribution

It provides a theoretical characterization of the limitations and capabilities of linear RNNs on a core copy task, including lower bounds and explicit optimal filters.

Findings

01

Derived lower bounds for approximation error in linear RNNs.

02

Constructed explicit filters that achieve the theoretical bounds.

03

Discovered an uncertainty principle relating filter range and accuracy.

Abstract

We consider linear recurrent neural networks, which have become a key building block of sequence modeling due to their ability for stable and effective long-range modeling. In this paper, we aim at characterizing this ability on a simple but core copy task, whose goal is to build a linear filter of order $S$ that approximates the filter that looks $K$ time steps in the past (which we refer to as the shift- $K$ filter), where $K$ is larger than $S$ . Using classical signal models and quadratic cost, we fully characterize the problem by providing lower bounds of approximation, as well as explicit filters that achieve this lower bound up to constants. The optimal performance highlights an uncertainty principle: the optimal filter has to average values around the $K$ -th time step in the past with a range~(width) that is proportional to $K / S$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fault Detection and Control Systems