Rethinking State Tracking in Recurrent Models Through Error Control Dynamics

Jiwan Chung; Heechan Choi; Seon Joo Kim

arXiv:2605.07755·cs.LG·May 11, 2026

Rethinking State Tracking in Recurrent Models Through Error Control Dynamics

Jiwan Chung, Heechan Choi, Seon Joo Kim

PDF

TL;DR

This paper challenges the focus on expressive capacity in recurrent models, emphasizing the importance of error control dynamics for robust state tracking, and demonstrates that affine recurrent networks have fundamental limitations in error correction.

Contribution

It introduces a theoretical framework highlighting error control as crucial for state tracking, revealing limitations of affine recurrent networks in error correction and robustness.

Findings

01

Affine recurrent networks cannot correct errors along state-separating subspaces.

02

Tracking failure occurs when within-class spread exceeds initial between-class separation.

03

Empirical results predict tracking collapse based on distinguishability ratio crossing a threshold.

Abstract

The theory of state tracking in recurrent architectures has predominantly focused on expressive capacity: whether a fixed architecture can theoretically realize a set of symbolic transition rules. We argue that equally important is error control, the dynamics governing hidden-state drift along the directions that distinguish symbolic states. We prove that affine recurrent networks, a class of models encompassing State-Space Models and Linear Attention, cannot correct errors along state-separating subspaces once they preserve state representations. Consequently, practical affine trackers do not learn robust state tracking; rather, they learn finite horizon solutions governed by accumulated state-relevant error. We characterize the mechanics of this failure, showing that tracking remains readable only while the accumulating within-class spread remains small relative to the initial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.