Forecasting Multivariate Time Series under Predictive Heterogeneity: A Validation-Driven Clustering Framework

Ziling Ma; \'Angel L\'opez Oriona; Hernando Ombao; Ying Sun

arXiv:2604.13748·stat.ME·April 16, 2026

Forecasting Multivariate Time Series under Predictive Heterogeneity: A Validation-Driven Clustering Framework

Ziling Ma, \'Angel L\'opez Oriona, Hernando Ombao, Ying Sun

PDF

TL;DR

This paper introduces a validation-driven clustering framework for multivariate time series forecasting that adaptively determines when to specialize models based on out-of-sample predictive performance, improving accuracy and robustness.

Contribution

It formulates adaptive pooling as a decision problem using validation errors to guide clustering, with a fallback mechanism to global models, enhancing reliability in heterogeneous forecasting tasks.

Findings

01

Consistent improvements over strong baselines on traffic datasets.

02

Robustness to heavy-tailed errors and local anomalies.

03

Effective avoidance of negative transfer when heterogeneity is weak.

Abstract

We study adaptive pooling under predictive heterogeneity in high-dimensional multivariate time series forecasting, where global models improve statistical efficiency but may fail to capture heterogeneous predictive structure, while naive specialization can induce negative transfer. We formulate adaptive pooling as a statistical decision problem and propose a validation-driven framework that determines when and how specialization should be applied. Rather than grouping series based on representation similarity, we define partitions through out-of-sample predictive performance, thereby aligning data organization with predictive risk, defined as expected out-of-sample loss and approximated via validation error. Cluster assignments are iteratively updated using validation losses for both point (Huber) and probabilistic (pinball) forecasting, improving robustness to heavy-tailed errors and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.