DeRegiME: Deep Regime Mixtures for Probabilistic Forecasting under Distribution Shift

Kieran Wood; Stefan Zohren; Stephen J. Roberts

arXiv:2605.19231·cs.LG·May 20, 2026

DeRegiME: Deep Regime Mixtures for Probabilistic Forecasting under Distribution Shift

Kieran Wood, Stefan Zohren, Stephen J. Roberts

PDF

TL;DR

DeRegiME is a probabilistic time series forecasting model that identifies and leverages latent regimes to improve predictive density and uncertainty quantification under distribution shifts.

Contribution

It introduces a novel deep mixture of experts model with a sparse Gaussian process that explicitly models regime changes and residual uncertainty.

Findings

01

DeRegiME improves negative log predictive density by 20.3% over baselines.

02

It achieves consistent gains across diverse datasets with distribution shifts.

03

The model provides an interpretable decomposition of mean, residual, and noise components.

Abstract

We introduce DeRegiME -- Deep Regime Mixture of Experts -- a direct multi-horizon probabilistic forecaster that separates latent uncertainty regimes from the underlying signal and softly assigns each forecast location to learned recurring regimes using a sparse variational Gaussian process (GP) whose nonstationary regime-mixing kernel and Student-t likelihood combine per-regime sub-kernels and noise processes via a shared gate. This yields a single sparse-GP posterior, not a mixture of GP experts. DeRegiME addresses a key limitation of neural forecasters: point forecasts discard residual uncertainty, and probabilistic heads -- whether single marginals, uninterpreted mixtures, quantile sets, or diffusion samples -- rarely expose the regime structure of the residual. Yet distribution shift in noisy heteroskedastic time series may be abrupt, gradual, or horizon-dependent and often appears…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.