Navigating Tomorrow: Reliably Assessing Large Language Models   Performance on Future Event Prediction

Petraq Nako; Adam Jatowt

arXiv:2501.05925·cs.CL·January 13, 2025

Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction

Petraq Nako, Adam Jatowt

PDF

Open Access

TL;DR

This paper evaluates the ability of large language models to predict future events across various scenarios, highlighting their potential and limitations in predictive tasks.

Contribution

It introduces a new dataset and methodology for assessing LLMs in future event prediction, an under-explored application area.

Findings

01

LLMs show potential in future event prediction tasks.

02

Performance varies across different question types and scenarios.

03

Limitations identified suggest areas for future model improvements.

Abstract

Predicting future events is an important activity with applications across multiple fields and domains. For example, the capacity to foresee stock market trends, natural disasters, business developments, or political events can facilitate early preventive measures and uncover new opportunities. Multiple diverse computational methods for attempting future predictions, including predictive analysis, time series forecasting, and simulations have been proposed. This study evaluates the performance of several large language models (LLMs) in supporting future prediction tasks, an under-explored domain. We assess the models across three scenarios: Affirmative vs. Likelihood questioning, Reasoning, and Counterfactual analysis. For this, we create a dataset1 by finding and categorizing news articles based on entity type and its popularity. We gather news articles before and after the LLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management