StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

Zhiyuan Chen; Yuxuan Zhong; Fan Wang; Bo Yu; Pengtao Shao; Shaoshan Liu; Ning Ding

arXiv:2603.23571·cs.LG·March 26, 2026

StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

Zhiyuan Chen, Yuxuan Zhong, Fan Wang, Bo Yu, Pengtao Shao, Shaoshan Liu, Ning Ding

PDF

Open Access

TL;DR

StateLinFormer introduces a stateful linear-attention model for navigation that maintains long-term memory across interactions, significantly improving long-horizon memory retention and adaptation in navigation tasks.

Contribution

It presents a novel stateful training paradigm for linear-attention models, enabling persistent memory across segments, which enhances long-term navigation capabilities.

Findings

01

Outperforms stateless and fixed-window Transformers in navigation tasks.

02

Significantly improves long-horizon memory retention with increased interaction length.

03

Enhances in-context learning capabilities for navigation.

Abstract

Effective navigation intelligence relies on long-term memory to support both immediate generalization and sustained adaptation. However, existing approaches face a dilemma: modular systems rely on explicit mapping but lack flexibility, while Transformer-based end-to-end models are constrained by fixed context windows, limiting persistent memory across extended interactions. We introduce StateLinFormer, a linear-attention navigation model trained with a stateful memory mechanism that preserves recurrent memory states across consecutive training segments instead of reinitializing them at each batch boundary. This training paradigm effectively approximates learning on infinitely long sequences, enabling the model to achieve long-horizon memory retention. Experiments across both MAZE and ProcTHOR environments demonstrate that StateLinFormer significantly outperforms its stateless…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications