The Forecast Critic: Leveraging Large Language Models for Poor Forecast Identification
Luke Bhan, Hanyu Zhang, Andrew Gordon Wilson, Michael W. Mahoney, Chuck Arvin

TL;DR
This paper explores how large language models can be used to automatically monitor and critique forecast quality in retail, demonstrating promising results in identifying errors and incorporating contextual information without domain-specific training.
Contribution
It systematically evaluates LLMs for forecast monitoring, showing their ability to detect forecast errors and incorporate unstructured data, advancing automated forecast evaluation methods.
Findings
LLMs can reliably detect forecast errors with an F1 score of 0.88.
Multi-modal LLMs effectively incorporate contextual signals, achieving an F1 score of 0.84.
Techniques work on real-world retail data, identifying unreasonable forecasts with at least 10% higher sCRPS.
Abstract
Monitoring forecasting systems is critical for customer satisfaction, profitability, and operational efficiency in large-scale retail businesses. We propose The Forecast Critic, a system that leverages Large Language Models (LLMs) for automated forecast monitoring, taking advantage of their broad world knowledge and strong ``reasoning'' capabilities. As a prerequisite for this, we systematically evaluate the ability of LLMs to assess time series forecast quality, focusing on three key questions. (1) Can LLMs be deployed to perform forecast monitoring and identify obviously unreasonable forecasts? (2) Can LLMs effectively incorporate unstructured exogenous features to assess what a reasonable forecast looks like? (3) How does performance vary across model sizes and reasoning capabilities, measured across state-of-the-art LLMs? We present three experiments, including on both synthetic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsForecasting Techniques and Applications · Stock Market Forecasting Methods · Traffic Prediction and Management Techniques
