TwinFormer: A Dual-Level Transformer for Long-Sequence Time-Series Forecasting

Mahima Kumavat; Aditya Maheshwari

arXiv:2512.12301·cs.LG·December 16, 2025

TwinFormer: A Dual-Level Transformer for Long-Sequence Time-Series Forecasting

Mahima Kumavat, Aditya Maheshwari

PDF

Open Access

TL;DR

TwinFormer introduces a hierarchical Transformer architecture with local and global attention mechanisms for efficient long-sequence time-series forecasting, achieving state-of-the-art results across diverse datasets.

Contribution

The paper proposes a novel dual-level Transformer architecture that combines local sparse attention and global inter-patch modeling for improved long-term forecasting.

Findings

01

Outperforms existing models on 8 real-world datasets

02

Achieves linear time and memory complexity

03

Demonstrates superior accuracy in MAE and RMSE metrics

Abstract

TwinFormer is a hierarchical Transformer for long-sequence time-series forecasting. It divides the input into non-overlapping temporal patches and processes them in two stages: (1) a Local Informer with top- $k$ Sparse Attention models intra-patch dynamics, followed by mean pooling; (2) a Global Informer captures long-range inter-patch dependencies using the same top- $k$ attention. A lightweight GRU aggregates the globally contextualized patch tokens for direct multi-horizon prediction. The resulting architecture achieves linear $O (k L d)$ time and memory complexity. On eight real-world benchmarking datasets from six different domains, including weather, stock price, temperature, power consumption, electricity, and disease, and forecasting horizons $96 - 720$ , TwinFormer secures $27$ positions in the top two out of $34$ . Out of the $27$ , it achieves the best performance on MAE and RMSE at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic Prediction and Management Techniques · Time Series Analysis and Forecasting · Stock Market Forecasting Methods