Billions-Scale Forecast Reconciliation
Tianyu Wang, Matthew C. Johnson, Steven Klee, Matthew L. Malloy

TL;DR
This paper addresses large-scale forecast reconciliation using optimization, demonstrating solutions for problems with billions of values, and establishing theoretical links to share-based methods for certain cases.
Contribution
It introduces scalable algorithms for forecast reconciliation at unprecedented data scales and formalizes the connection between least-squares and share-based reconciliation methods.
Findings
Successfully reconciled over four billion forecasted values.
Demonstrated the largest constrained least-squares problem solved to date.
Established theoretical equivalence between least-squares and share-based reconciliation under specific conditions.
Abstract
The problem of combining multiple forecasts of related quantities that obey expected equality and additivity constraints, often referred to a hierarchical forecast reconciliation, is naturally stated as a simple optimization problem. In this paper we explore optimization-based point forecast reconciliation at scales faced by large retailers. We implement and benchmark several algorithms to solve the forecast reconciliation problem, showing efficacy when the dimension of the problem exceeds four billion forecasted values. To the best of our knowledge, this is the largest forecast reconciliation problem, and perhaps on-par with the largest constrained least-squares-problem ever solved. We also make several theoretical contributions. We show that for a restricted class of problems and when the loss function is weighted appropriately, least-squares forecast reconciliation is equivalent to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications · Financial Risk and Volatility Modeling · Imbalanced Data Classification Techniques
