# Memory-Augmented Spatio-Temporal Transformer for Robust Traffic Flow Forecasting

**Authors:** Puqing Hu, Chunjiang Wu, Chen Wang, Xin Yang, Zhibin Li, Tinghui Chen, Shijie Zhou

PMC · DOI: 10.3390/biomimetics11030170 · Biomimetics · 2026-03-02

## TL;DR

This paper introduces a new model for predicting traffic flow that uses memory to better capture long-term patterns and outperforms existing methods.

## Contribution

The novel contribution is integrating a learnable memory tensor into an attention-based framework for traffic flow forecasting.

## Key findings

- The proposed model achieves superior prediction accuracy on real-world traffic datasets.
- The memory mechanism enables efficient and dynamic spatio-temporal representation learning.
- The model offers robustness and a lightweight architecture compared to existing baselines.

## Abstract

Accurate traffic flow prediction plays a critical role in intelligent transportation systems, supporting traffic management, congestion mitigation, and efficient utilization of road resources. Advances in neural network-based methods, particularly graph neural networks (GNNs) and attention-based models, have demonstrated strong capability in modeling spatio-temporal traffic dynamics. However, existing approaches still face notable challenges: GNN-based models often rely on static adjacency matrices, limiting their ability to capture dynamic and long-range spatial dependencies, while attention-based models usually involve complex architectures and heavy reliance on large-scale pre-training data. To address these limitations, this study proposes a novel traffic flow prediction model that integrates a learnable memory tensor into an attention-based framework. The introduced memory mechanism provides persistent global context for modeling long-term temporal dependencies in an end-to-end manner, enabling efficient and dynamic spatio-temporal representation learning with a lightweight architecture. Extensive experiments on multiple real-world traffic datasets demonstrate that the proposed model achieves superior prediction accuracy and robustness compared with existing baselines. The proposed approach offers a new perspective for memory-enhanced spatio-temporal modeling and provides valuable insights for traffic forecasting and related intelligent transportation applications.

## Full-text entities

- **Diseases:** injury to (MESH:D014947)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** PeMS04 — Homo sapiens (Human), Melanoma, Cancer cell line (CVCL_S856)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13023845/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13023845/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/PMC13023845/full.md

---
Source: https://tomesphere.com/paper/PMC13023845