Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints
Jian-Qiao Zhu, Haijiang Yan, Thomas L. Griffiths

TL;DR
This paper introduces a method to recover coherent event probabilities from large language model embeddings by enforcing probability axioms in a latent space, improving the coherence and accuracy of the estimated probabilities.
Contribution
The authors propose a novel approach using a variational autoencoder with axiomatic constraints to derive more coherent and accurate event probabilities from LLM embeddings.
Findings
Recovered probabilities are more coherent than direct model outputs.
Estimated probabilities closely match true probabilities in experiments.
Method improves the reliability of uncertainty estimates in LLMs.
Abstract
Rational decision-making under uncertainty requires coherent degrees of belief in events. However, event probabilities generated by Large Language Models (LLMs) have been shown to exhibit incoherence, violating the axioms of probability theory. This raises the question of whether coherent event probabilities can be recovered from the embeddings used by the models. If so, those derived probabilities could be used as more accurate estimates in events involving uncertainty. To explore this question, we propose enforcing axiomatic constraints, such as the additive rule of probability theory, in the latent space learned by an extended variational autoencoder (VAE) applied to LLM embeddings. This approach enables event probabilities to naturally emerge in the latent space as the VAE learns to both reconstruct the original embeddings and predict the embeddings of semantically related events.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsALIGN
