On the attribution of confidence to large language models

Geoff Keeling; Winnie Street

arXiv:2407.08388·cs.AI·April 15, 2025

On the attribution of confidence to large language models

Geoff Keeling, Winnie Street

PDF

Open Access

TL;DR

This paper explores the conceptual and philosophical foundations of attributing confidence levels to large language models, questioning the interpretation, existence, and evaluation of LLM credences.

Contribution

It defends a literal interpretation of LLM credence attributions, discusses their metaphysical plausibility, and highlights epistemic skepticism regarding current evaluation methods.

Findings

01

LLM credence attributions are likely correctly interpreted as beliefs.

02

The existence of LLM credences is plausible but not conclusively established.

03

Current evaluation techniques may not truth-track LLM credences.

Abstract

Credences are mental states corresponding to degrees of confidence in propositions. Attribution of credences to Large Language Models (LLMs) is commonplace in the empirical literature on LLM evaluation. Yet the theoretical basis for LLM credence attribution is unclear. We defend three claims. First, our semantic claim is that LLM credence attributions are (at least in general) correctly interpreted literally, as expressing truth-apt beliefs on the part of scientists that purport to describe facts about LLM credences. Second, our metaphysical claim is that the existence of LLM credences is at least plausible, although current evidence is inconclusive. Third, our epistemic claim is that LLM credence attributions made in the empirical literature on LLM evaluation are subject to non-trivial sceptical concerns. It is a distinct possibility that even if LLMs have credences, LLM credence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling