Beyond Na\"ive Prompting: Strategies for Improved Context-aided Forecasting with LLMs
Arjun Ashok, Andrew Robert Williams, Vincent Zhihao Zheng, Irina Rish, Nicolas Chapados, \'Etienne Marcotte, Valentina Zantedeschi, Alexandre Drouin

TL;DR
This paper introduces a comprehensive framework with four strategies to enhance context-aided forecasting using large language models, addressing diagnostics, accuracy, and efficiency to improve practical deployment.
Contribution
It proposes a unified set of strategies for diagnosing, improving, and optimizing LLM-based forecasting, filling gaps in understanding and efficiency.
Findings
Diagnostic strategies reveal the 'Execution Gap' in model reasoning.
Accuracy improvements of 25-50% through targeted strategies.
Adaptive routing reduces inference costs while maintaining accuracy.
Abstract
Real-world forecasting requires models to integrate not only historical data but also relevant contextual information provided in textual form. While large language models (LLMs) show promise for context-aided forecasting, critical challenges remain: we lack diagnostic tools to understand failure modes, performance remains far below their potential, and high computational costs limit practical deployment. We introduce a unified framework of four strategies that address these limitations along three orthogonal dimensions: model diagnostics, accuracy, and efficiency. Through extensive evaluation across model families from small open-source models to frontier models including Gemini, GPT, and Claude, we uncover both fundamental insights and practical solutions. Our findings span three key dimensions: diagnostic strategies reveal the "Execution Gap" where models correctly explain how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
