DeRegiME: Deep Regime Mixtures for Probabilistic Forecasting under Distribution Shift
Kieran Wood, Stefan Zohren, Stephen J. Roberts

TL;DR
DeRegiME is a probabilistic time series forecasting model that identifies and leverages latent regimes to improve predictive density and uncertainty quantification under distribution shifts.
Contribution
It introduces a novel deep mixture of experts model with a sparse Gaussian process that explicitly models regime changes and residual uncertainty.
Findings
DeRegiME improves negative log predictive density by 20.3% over baselines.
It achieves consistent gains across diverse datasets with distribution shifts.
The model provides an interpretable decomposition of mean, residual, and noise components.
Abstract
We introduce DeRegiME -- Deep Regime Mixture of Experts -- a direct multi-horizon probabilistic forecaster that separates latent uncertainty regimes from the underlying signal and softly assigns each forecast location to learned recurring regimes using a sparse variational Gaussian process (GP) whose nonstationary regime-mixing kernel and Student-t likelihood combine per-regime sub-kernels and noise processes via a shared gate. This yields a single sparse-GP posterior, not a mixture of GP experts. DeRegiME addresses a key limitation of neural forecasters: point forecasts discard residual uncertainty, and probabilistic heads -- whether single marginals, uninterpreted mixtures, quantile sets, or diffusion samples -- rarely expose the regime structure of the residual. Yet distribution shift in noisy heteroskedastic time series may be abrupt, gradual, or horizon-dependent and often appears…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
