TL;DR
TimeRFT introduces reinforcement learning techniques to improve the adaptation and generalization of time series foundation models across diverse forecasting tasks and data regimes.
Contribution
The paper proposes a novel reinforcement finetuning paradigm, TimeRFT, with task-specific strategies to enhance TSFMs' robustness and generalization capabilities.
Findings
TimeRFT outperforms supervised fine-tuning methods in various forecasting tasks.
It improves prediction accuracy under distribution shifts.
TimeRFT adapts well across different data availability scenarios.
Abstract
Time Series Foundation Models (TSFMs) advance generalization and data efficiency in time series forecasting by unified large-scale pretraining. But TSFMs remain lacking when adapting to specific downstream forecasting tasks for two reasons. First, the non-stationary and uncertain nature of time series data lead to inevitable temporal distribution shifts between historical training and future testing data, while current Supervised FineTuning (SFT)-based methods are prone to overfitting and may degrade generalization. Second, training data availability varies across forecasting tasks, requiring TSFMs to generalize well under diverse data regimes. To address these challenges, we introduce the Time series Reinforcement Finetuning (TimeRFT) paradigm for TSFM downstream adaptation, which consists of two task-specific training recipes: i) A forecasting quality-based temporal reward mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
