Standards for Belief Representations in LLMs

Daniel A. Herrmann; Benjamin A. Levinstein

arXiv:2405.21030·cs.AI·March 17, 2025

Standards for Belief Representations in LLMs

Daniel A. Herrmann, Benjamin A. Levinstein

PDF

Open Access

TL;DR

This paper proposes a set of theoretical criteria—accuracy, coherence, uniformity, and use—for evaluating whether large language models internally represent beliefs, aiming to establish a foundational framework for belief measurement in LLMs.

Contribution

It introduces a unified theoretical framework with four criteria to assess belief-like representations in LLMs, bridging philosophy, decision theory, and machine learning.

Findings

01

Empirical evidence shows limitations of isolated criteria in identifying beliefs.

02

The proposed criteria provide a balanced approach combining theory and practical constraints.

03

Lays groundwork for standardized belief measurement in LLM research.

Abstract

As large language models (LLMs) continue to demonstrate remarkable abilities across various domains, computer scientists are developing methods to understand their cognitive processes, particularly concerning how (and if) LLMs internally represent their beliefs about the world. However, this field currently lacks a unified theoretical foundation to underpin the study of belief in LLMs. This article begins filling this gap by proposing adequacy conditions for a representation in an LLM to count as belief-like. We argue that, while the project of belief measurement in LLMs shares striking features with belief measurement as carried out in decision theory and formal epistemology, it also differs in ways that should change how we measure belief. Thus, drawing from insights in philosophy and contemporary practices of machine learning, we establish four criteria that balance theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Data Quality and Management · Natural Language Processing Techniques