CaReTS: A Multi-Task Framework Unifying Classification and Regression for Time Series Forecasting
Fulong Yao, Wanqing Zhao, Chao Zheng, Xiaofei Han

TL;DR
CaReTS is a multi-task framework that unifies classification and regression to improve both accuracy and interpretability in multi-step time series forecasting.
Contribution
It introduces a dual-stream architecture with uncertainty-aware multi-task loss, integrating various temporal encoders for enhanced forecasting and trend classification.
Findings
Outperforms state-of-the-art forecasting models.
Achieves higher trend classification accuracy.
Provides interpretable trend and deviation predictions.
Abstract
Recent advances in deep forecasting models have achieved remarkable performance, yet most approaches still struggle to provide both accurate predictions and interpretable insights into temporal dynamics. This paper proposes CaReTS, a novel multi-task learning framework that combines classification and regression tasks for multi-step time series forecasting problems. The framework adopts a dual-stream architecture, where a classification branch learns the stepwise trend into the future, while a regression branch estimates the corresponding deviations from the latest observation of the target variable. The dual-stream design provides more interpretable predictions by disentangling macro-level trends from micro-level deviations in the target variable. To enable effective learning in output prediction, deviation estimation, and trend classification, we design a multi-task loss with…
Peer Reviews
Decision·Submitted to ICLR 2026
• 1. The authors apply the proposed architecture to various architectures, demonstrating its strong adaptability. • 2. The authors explicitly separate the regression and classification modules, which indeed enhances the model's interpretability.
• Jointly optimizing multiple tasks (e.g., regression and classification) through a shared encoder with separate task-specific heads is already a widely adopted approach across various fields[1][2] and does not represent a significant novelty. Additionally, incorporating uncertainty-based loss weighting for adaptive multi-task loss balancing is also widely adopted [3]. • [1] A Multitask Deep Learning Model for Classification and Regression of Hyperspectral Images: Application to the Large-Scale
1. Explicit trend/deviation separation provides actionable insights for energy/finance applications-a genuine practical advantage often overlooked in pure accuracy-focused papers 2. 200-400s training is substantially faster than Autoformer (>460s) and SOIT2FNN-MO (>860s) while maintaining competitive accuracy. 3. Good trend accuracy, remaining stable across forecast horizons (Figure 5) is valuable for risk management applications 4. Comprehensive ablations across encoders, single vs multi-task
1. Multi-task classification + regression is well-established; the core contribution is applying this to time series with uncertainty weighting. The architectural variants (CaReTS1-4) differ only in fusion; this feels more like hyperparameter exploration than distinct methodological contributions. 2. Narrow evaluation scope: a. Only 2 datasets (both hourly energy data) from a single domain b. Missing standard benchmarks (ETTh1/h2, Weather, Traffic, Exchange) used in TimesNet[4]/iTransformer[5]
1. Paper is clear and well presented. 2. Framework appears to be encoder agnostic and can be easily plugged in any encoder architecture. 3. Modest improvements in the performance for predictions in the two datasets considered.
The advantages of the proposed architecture are not clear based on the results in the paper. The gains in the performance are very minor compared to traditional encoder architectures to substantiate the multi-task framework. The models have been tested on two niche datasets. The paper probably needs to consider standard time series foundation model datasets like ETTh1, Weather etc to clearly demonstrate the advantages. The paper is probably better placed to be in a domain specific conference
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Forecasting Techniques and Applications · Data Stream Mining Techniques
