Belief Revision: The Adaptability of Large Language Models Reasoning

Bryan Wilie; Samuel Cahyawijaya; Etsuko Ishii; Junxian He; Pascale; Fung

arXiv:2406.19764·cs.CL·October 18, 2024

Belief Revision: The Adaptability of Large Language Models Reasoning

Bryan Wilie, Samuel Cahyawijaya, Etsuko Ishii, Junxian He, Pascale, Fung

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Belief-R, a dataset to evaluate large language models' ability to revise beliefs with new evidence, revealing current limitations and trade-offs in their reasoning adaptability.

Contribution

The paper presents Belief-R and the delta reasoning framework, providing a new benchmark for assessing and understanding LMs' belief revision capabilities.

Findings

01

Most LMs struggle with belief revision tasks.

02

Models that update well often underperform without updates.

03

Significant trade-offs exist between updating and maintaining beliefs.

Abstract

The capability to reason from text is crucial for real-world NLP applications. Real-world scenarios often involve incomplete or evolving data. In response, individuals update their beliefs and understandings accordingly. However, most existing evaluations assume that language models (LMs) operate with consistent information. We introduce Belief-R, a new dataset designed to test LMs' belief revision ability when presented with new evidence. Inspired by how humans suppress prior inferences, this task assesses LMs within the newly proposed delta reasoning ( $Δ R$ ) framework. Belief-R features sequences of premises designed to simulate scenarios where additional information could necessitate prior conclusions drawn by LMs. We evaluate $\sim$ 30 LMs across diverse prompting strategies and found that LMs generally struggle to appropriately revise their beliefs in response to new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hltchkust/belief-revision
noneOfficial

Videos

Belief Revision: The Adaptability of Large Language Models Reasoning· underline

Taxonomy

TopicsBayesian Modeling and Causal Inference · Topic Modeling · Explainable Artificial Intelligence (XAI)