Rethinking the Uncertainty: A Critical Review and Analysis in the Era of   Large Language Models

Mohammad Beigi; Sijia Wang; Ying Shen; Zihao Lin; Adithya Kulkarni,; Jianfeng He; Feng Chen; Ming Jin; Jin-Hee Cho; Dawei Zhou; Chang-Tien Lu,; Lifu Huang

arXiv:2410.20199·cs.AI·October 29, 2024

Rethinking the Uncertainty: A Critical Review and Analysis in the Era of Large Language Models

Mohammad Beigi, Sijia Wang, Ying Shen, Zihao Lin, Adithya Kulkarni,, Jianfeng He, Feng Chen, Ming Jin, Jin-Hee Cho, Dawei Zhou, Chang-Tien Lu,, Lifu Huang

PDF

Open Access

TL;DR

This paper critically reviews uncertainty estimation in Large Language Models, proposing a comprehensive framework to categorize and understand different uncertainty types to improve reliability and safety in AI applications.

Contribution

It introduces a novel framework for identifying and understanding uncertainties in LLMs, addressing limitations of current methods and guiding future research.

Findings

01

Current uncertainty estimation methods often misidentify sources of uncertainty

02

The framework categorizes uncertainties specific to LLMs

03

Insights into improving safety-critical applications

Abstract

In recent years, Large Language Models (LLMs) have become fundamental to a broad spectrum of artificial intelligence applications. As the use of LLMs expands, precisely estimating the uncertainty in their predictions has become crucial. Current methods often struggle to accurately identify, measure, and address the true uncertainty, with many focusing primarily on estimating model confidence. This discrepancy is largely due to an incomplete understanding of where, when, and how uncertainties are injected into models. This paper introduces a comprehensive framework specifically designed to identify and understand the types and sources of uncertainty, aligned with the unique characteristics of LLMs. Our framework enhances the understanding of the diverse landscape of uncertainties by systematically categorizing and defining each type, establishing a solid foundation for developing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques