Beyond Na\"ive Prompting: Strategies for Improved Context-aided Forecasting with LLMs

Arjun Ashok; Andrew Robert Williams; Vincent Zhihao Zheng; Irina Rish; Nicolas Chapados; \'Etienne Marcotte; Valentina Zantedeschi; Alexandre Drouin

arXiv:2508.09904·cs.LG·March 2, 2026

Beyond Na\"ive Prompting: Strategies for Improved Context-aided Forecasting with LLMs

Arjun Ashok, Andrew Robert Williams, Vincent Zhihao Zheng, Irina Rish, Nicolas Chapados, \'Etienne Marcotte, Valentina Zantedeschi, Alexandre Drouin

PDF

TL;DR

This paper introduces a comprehensive framework with four strategies to enhance context-aided forecasting using large language models, addressing diagnostics, accuracy, and efficiency to improve practical deployment.

Contribution

It proposes a unified set of strategies for diagnosing, improving, and optimizing LLM-based forecasting, filling gaps in understanding and efficiency.

Findings

01

Diagnostic strategies reveal the 'Execution Gap' in model reasoning.

02

Accuracy improvements of 25-50% through targeted strategies.

03

Adaptive routing reduces inference costs while maintaining accuracy.

Abstract

Real-world forecasting requires models to integrate not only historical data but also relevant contextual information provided in textual form. While large language models (LLMs) show promise for context-aided forecasting, critical challenges remain: we lack diagnostic tools to understand failure modes, performance remains far below their potential, and high computational costs limit practical deployment. We introduce a unified framework of four strategies that address these limitations along three orthogonal dimensions: model diagnostics, accuracy, and efficiency. Through extensive evaluation across model families from small open-source models to frontier models including Gemini, GPT, and Claude, we uncover both fundamental insights and practical solutions. Our findings span three key dimensions: diagnostic strategies reveal the "Execution Gap" where models correctly explain how…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.