Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs
Yitong Zhou, Yucong Luo, Mingyue Cheng, Qi Liu, Jiahao Wang, Daoyu Wang, Enhong Chen

TL;DR
This paper introduces Time-R1, a reinforcement fine-tuning framework that enhances large language models' multi-step reasoning for time series forecasting, outperforming traditional methods.
Contribution
It proposes a novel two-stage reinforcement learning approach with a specialized reward and sampling strategy to improve LLMs' reasoning in TSF tasks.
Findings
Time-R1 significantly improves forecasting accuracy across various datasets.
The method enhances multi-step reasoning capabilities of LLMs for TSF.
Reinforcement fine-tuning outperforms prompt engineering alone.
Abstract
To advance time series forecasting (TSF), various methods have been proposed to improve prediction accuracy, evolving from statistical techniques to data-driven deep learning architectures. Despite their effectiveness, most existing methods still adhere to a fast thinking paradigm-relying on extracting historical patterns and mapping them to future values as their core modeling philosophy, lacking an explicit thinking process that incorporates intermediate time series reasoning. Meanwhile, emerging slow-thinking LLMs (e.g., OpenAI-o1) have shown remarkable multi-step reasoning capabilities, offering an alternative way to overcome these issues. However, prompt engineering alone presents several limitations - including high computational cost, privacy risks, and limited capacity for in-depth domain-specific time series reasoning. To address these limitations, a more promising approach is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
