Selective Learning for Deep Time Series Forecasting

Yisong Fu; Zezhi Shao; Chengqing Yu; Yujie Li; Zhulin An; Qi Wang; Yongjun Xu; Fei Wang

arXiv:2510.25207·cs.LG·October 30, 2025

Selective Learning for Deep Time Series Forecasting

Yisong Fu, Zezhi Shao, Chengqing Yu, Yujie Li, Zhulin An, Qi Wang, Yongjun Xu, Fei Wang

PDF

TL;DR

This paper introduces a selective learning strategy for deep time series forecasting that improves model generalization by focusing on reliable, non-anomalous, and less uncertain timesteps, reducing overfitting.

Contribution

The paper proposes a dual-mask selective learning framework that filters uncertain and anomalous timesteps, enhancing deep TSF model performance.

Findings

01

Significant MSE reduction across multiple models and datasets

02

Improved forecasting accuracy with selective learning approach

03

Effective filtering of uncertain and anomalous data points

Abstract

Benefiting from high capacity for capturing complex temporal patterns, deep learning (DL) has significantly advanced time series forecasting (TSF). However, deep models tend to suffer from severe overfitting due to the inherent vulnerability of time series to noise and anomalies. The prevailing DL paradigm uniformly optimizes all timesteps through the MSE loss and learns those uncertain and anomalous timesteps without difference, ultimately resulting in overfitting. To address this, we propose a novel selective learning strategy for deep TSF. Specifically, selective learning screens a subset of the whole timesteps to calculate the MSE loss in optimization, guiding the model to focus on generalizable timesteps while disregarding non-generalizable ones. Our framework introduces a dual-mask mechanism to target timesteps: (1) an uncertainty mask leveraging residual entropy to filter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.