AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy
Philipp Schoenegger, Peter S. Park, Ezra Karger, Sean Trott, Philip E., Tetlock

TL;DR
This study demonstrates that large language model assistants significantly improve human forecasting accuracy, especially when providing high-quality advice, with potential implications for decision-making and cognitive augmentation.
Contribution
It is the first to empirically evaluate the impact of different types of LLM assistants on human forecasting accuracy in a large-scale experiment.
Findings
LLM assistants increased prediction accuracy by 24-28%.
Superforecasting LLM improved accuracy by 41% with outliers included.
No consistent evidence that benefits vary by forecaster skill or question difficulty.
Abstract
Large language models (LLMs) match and sometimes exceeding human performance in many domains. This study explores the potential of LLMs to augment human judgement in a forecasting task. We evaluate the effect on human forecasters of two LLM assistants: one designed to provide high-quality ("superforecasting") advice, and the other designed to be overconfident and base-rate neglecting, thus providing noisy forecasting advice. We compare participants using these assistants to a control group that received a less advanced model that did not provide numerical predictions or engaged in explicit discussion of predictions. Participants (N = 991) answered a set of six forecasting questions and had the option to consult their assigned LLM assistant throughout. Our preregistered analyses show that interacting with each of our frontier LLM assistants significantly enhances prediction accuracy by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications
MethodsSparse Evolutionary Training
