Enhancing Transformer-Based Foundation Models for Time Series Forecasting via Bagging, Boosting and Statistical Ensembles
Dhruv D. Modi, Rong Pan

TL;DR
This paper enhances transformer-based time series foundation models by integrating ensemble and statistical techniques, significantly improving their robustness, accuracy, and uncertainty quantification in real-world forecasting scenarios.
Contribution
It introduces a suite of ensemble and statistical methods to improve TSFMs, demonstrating consistent performance gains over standalone models on a real dataset.
Findings
Regression ensembles achieve lowest mean squared error.
Bootstrap aggregation reduces long-context errors.
Residual modeling corrects systematic bias.
Abstract
Time series foundation models (TSFMs) such as Lag-Llama, TimeGPT, Chronos, MOMENT, UniTS, and TimesFM have shown strong generalization and zero-shot capabilities for time series forecasting, anomaly detection, classification, and imputation. Despite these advantages, their predictions still suffer from variance, domain-specific bias, and limited uncertainty quantification when deployed on real operational data. This paper investigates a suite of statistical and ensemble-based enhancement techniques, including bootstrap-based bagging, regression-based stacking, prediction interval construction, statistical residual modeling, and iterative error feedback, to improve robustness and accuracy. Using the Belgium Electricity Short-Term Load Forecasting dataset as a case study, we demonstrate that the proposed hybrids consistently outperform standalone foundation models across multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
