Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints

Jian-Qiao Zhu; Haijiang Yan; Thomas L. Griffiths

arXiv:2505.07883·cs.CL·May 14, 2025

Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints

Jian-Qiao Zhu, Haijiang Yan, Thomas L. Griffiths

PDF

TL;DR

This paper introduces a method to recover coherent event probabilities from large language model embeddings by enforcing probability axioms in a latent space, improving the coherence and accuracy of the estimated probabilities.

Contribution

The authors propose a novel approach using a variational autoencoder with axiomatic constraints to derive more coherent and accurate event probabilities from LLM embeddings.

Findings

01

Recovered probabilities are more coherent than direct model outputs.

02

Estimated probabilities closely match true probabilities in experiments.

03

Method improves the reliability of uncertainty estimates in LLMs.

Abstract

Rational decision-making under uncertainty requires coherent degrees of belief in events. However, event probabilities generated by Large Language Models (LLMs) have been shown to exhibit incoherence, violating the axioms of probability theory. This raises the question of whether coherent event probabilities can be recovered from the embeddings used by the models. If so, those derived probabilities could be used as more accurate estimates in events involving uncertainty. To explore this question, we propose enforcing axiomatic constraints, such as the additive rule of probability theory, in the latent space learned by an extended variational autoencoder (VAE) applied to LLM embeddings. This approach enables event probabilities to naturally emerge in the latent space as the VAE learns to both reconstruct the original embeddings and predict the embeddings of semantically related events.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsALIGN