AI-Augmented Predictions: LLM Assistants Improve Human Forecasting   Accuracy

Philipp Schoenegger; Peter S. Park; Ezra Karger; Sean Trott; Philip E.; Tetlock

arXiv:2402.07862·cs.CY·August 23, 2024·3 cites

AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy

Philipp Schoenegger, Peter S. Park, Ezra Karger, Sean Trott, Philip E., Tetlock

PDF

Open Access

TL;DR

This study demonstrates that large language model assistants significantly improve human forecasting accuracy, especially when providing high-quality advice, with potential implications for decision-making and cognitive augmentation.

Contribution

It is the first to empirically evaluate the impact of different types of LLM assistants on human forecasting accuracy in a large-scale experiment.

Findings

01

LLM assistants increased prediction accuracy by 24-28%.

02

Superforecasting LLM improved accuracy by 41% with outliers included.

03

No consistent evidence that benefits vary by forecaster skill or question difficulty.

Abstract

Large language models (LLMs) match and sometimes exceeding human performance in many domains. This study explores the potential of LLMs to augment human judgement in a forecasting task. We evaluate the effect on human forecasters of two LLM assistants: one designed to provide high-quality ("superforecasting") advice, and the other designed to be overconfident and base-rate neglecting, thus providing noisy forecasting advice. We compare participants using these assistants to a control group that received a less advanced model that did not provide numerical predictions or engaged in explicit discussion of predictions. Participants (N = 991) answered a set of six forecasting questions and had the option to consult their assigned LLM assistant throughout. Our preregistered analyses show that interacting with each of our frontier LLM assistants significantly enhances prediction accuracy by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsForecasting Techniques and Applications

MethodsSparse Evolutionary Training