TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning
Andreas Auer, Patrick Podest, Daniel Klotz, Sebastian B\"ock, G\"unter Klambauer, Sepp Hochreiter

TL;DR
TiRex introduces an enhanced LSTM model with in-context learning capabilities, achieving state-of-the-art zero-shot time series forecasting across various horizons by combining state-tracking with a novel training masking strategy.
Contribution
The paper presents TiRex, a new LSTM-based model with improved in-context learning and state-tracking, outperforming transformer-based models in zero-shot forecasting tasks.
Findings
TiRex achieves new state-of-the-art results on HuggingFace benchmarks.
It outperforms larger models like TabPFN-TS and TimesFM.
Effective for both short- and long-term forecasting.
Abstract
In-context learning, the ability of large language models to perform tasks using only examples provided in the prompt, has recently been adapted for time series forecasting. This paradigm enables zero-shot prediction, where past values serve as context for forecasting future values, making powerful forecasting tools accessible to non-experts and increasing the performance when training data are scarce. Most existing zero-shot forecasting approaches rely on transformer architectures, which, despite their success in language, often fall short of expectations in time series forecasting, where recurrent models like LSTMs frequently have the edge. Conversely, while LSTMs are well-suited for time series modeling due to their state-tracking capabilities, they lack strong in-context learning abilities. We introduce TiRex that closes this gap by leveraging xLSTM, an enhanced LSTM with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications · Traffic Prediction and Management Techniques · Stock Market Forecasting Methods
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
