What Are the Odds? Language Models Are Capable of Probabilistic   Reasoning

Akshay Paruchuri; Jake Garrison; Shun Liao; John Hernandez; Jacob; Sunshine; Tim Althoff; Xin Liu; Daniel McDuff

arXiv:2406.12830·cs.CL·October 1, 2024

What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Akshay Paruchuri, Jake Garrison, Shun Liao, John Hernandez, Jacob, Sunshine, Tim Althoff, Xin Liu, Daniel McDuff

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper evaluates the probabilistic reasoning abilities of language models across various tasks and contexts, revealing their capacity to infer and work with probability distributions, especially when aided by context and assumptions.

Contribution

It introduces a systematic evaluation framework and a new benchmark dataset for assessing LMs' probabilistic reasoning skills across multiple tasks.

Findings

01

Models can infer distribution properties with proper context.

02

Real-world context improves probabilistic reasoning.

03

Simplified assumptions can aid inference even if incorrect.

Abstract

Language models (LM) are capable of remarkably complex linguistic tasks; however, numerical reasoning is an area in which they frequently struggle. An important but rarely evaluated form of reasoning is understanding probability distributions. In this paper, we focus on evaluating the probabilistic reasoning capabilities of LMs using idealized and real-world statistical distributions. We perform a systematic evaluation of state-of-the-art LMs on three tasks: estimating percentiles, drawing samples, and calculating probabilities. We evaluate three ways to provide context to LMs 1) anchoring examples from within a distribution or family of distributions, 2) real-world context, 3) summary statistics on which to base a Normal approximation. Models can make inferences about distributions, and can be further aided by the incorporation of real-world context, example shots and simplified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yahskapar/LLMs-and-Probabilistic-Reasoning
noneOfficial

Videos

What Are the Odds? Language Models Are Capable of Probabilistic Reasoning· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsBalanced Selection · Focus