A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions

Ola Shorinwa; Zhiting Mei; Justin Lidard; Allen Z. Ren; and Anirudha Majumdar

arXiv:2412.05563·cs.CL·July 3, 2025·3 cites

A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions

Ola Shorinwa, Zhiting Mei, Justin Lidard, Allen Z. Ren, and Anirudha Majumdar

PDF

Open Access

TL;DR

This survey reviews uncertainty quantification methods for large language models, categorizing existing approaches, discussing their applications, and highlighting open challenges to improve trustworthiness and reliability.

Contribution

It provides a comprehensive taxonomy of uncertainty quantification techniques for LLMs and discusses their applications and open research challenges.

Findings

01

Existing methods are categorized within a unified taxonomy.

02

Uncertainty quantification can help detect hallucinations in LLM outputs.

03

Open challenges include developing more accurate and scalable uncertainty measures.

Abstract

The remarkable performance of large language models (LLMs) in content generation, coding, and common-sense reasoning has spurred widespread integration into many facets of society. However, integration of LLMs raises valid questions on their reliability and trustworthiness, given their propensity to generate hallucinations: plausible, factually-incorrect responses, which are expressed with striking confidence. Previous work has shown that hallucinations and other non-factual responses generated by LLMs can be detected by examining the uncertainty of the LLM in its response to the pertinent prompt, driving significant research efforts devoted to quantifying the uncertainty of LLMs. This survey seeks to provide an extensive review of existing uncertainty quantification methods for LLMs, identifying their salient features, along with their strengths and weaknesses. We present existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling