Reasoning over Uncertain Text by Generative Large Language Models

Aliakbar Nafar; Kristen Brent Venable; Parisa Kordjamshidi

arXiv:2402.09614·cs.CL·December 30, 2024·2 cites

Reasoning over Uncertain Text by Generative Large Language Models

Aliakbar Nafar, Kristen Brent Venable, Parisa Kordjamshidi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces BLInD, a new dataset for testing probabilistic reasoning in LLMs, and evaluates prompting strategies that improve their reasoning over uncertain text.

Contribution

The paper presents BLInD, a specialized dataset for probabilistic reasoning, and proposes prompting methods that enhance LLMs' reasoning capabilities over uncertain information.

Findings

01

Prompting strategies improve LLM performance on probabilistic reasoning tasks.

02

BLInD reveals significant limitations of current LLMs in handling uncertainty.

03

Methods are effective across multiple LLM architectures.

Abstract

This paper considers the challenges Large Language Models (LLMs) face when reasoning over text that includes information involving uncertainty explicitly quantified via probability values. This type of reasoning is relevant to a variety of contexts ranging from everyday conversations to medical decision-making. Despite improvements in the mathematical reasoning capabilities of LLMs, they still exhibit significant difficulties when it comes to probabilistic reasoning. To deal with this problem, we introduce the Bayesian Linguistic Inference Dataset (BLInD), a new dataset specifically designed to test the probabilistic reasoning capabilities of LLMs. We use BLInD to find out the limitations of LLMs for tasks involving probabilistic reasoning. In addition, we present several prompting strategies that map the problem to different formal representations, including Python code, probabilistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hlr/blind
noneOfficial

Videos

Reasoning over Uncertain Text by Generative Large Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling