Uncertainty Quantification in Large Language Models Through Convex Hull   Analysis

Ferhat Ozgur Catak; Murat Kuzlu

arXiv:2406.19712·cs.AI·July 1, 2024·2 cites

Uncertainty Quantification in Large Language Models Through Convex Hull Analysis

Ferhat Ozgur Catak, Murat Kuzlu

PDF

Open Access

TL;DR

This paper introduces a geometric method using convex hull analysis to quantify uncertainty in large language models by analyzing response embeddings and their dispersion across different prompt complexities and settings.

Contribution

It presents a novel approach leveraging convex hulls and clustering of embeddings to measure uncertainty in LLM outputs, addressing limitations of traditional probabilistic methods.

Findings

01

Uncertainty varies with prompt complexity, model, and temperature.

02

Convex hull analysis effectively captures output variability.

03

Embedding dispersion correlates with response confidence.

Abstract

Uncertainty quantification approaches have been more critical in large language models (LLMs), particularly high-risk applications requiring reliable outputs. However, traditional methods for uncertainty quantification, such as probabilistic models and ensemble techniques, face challenges when applied to the complex and high-dimensional nature of LLM-generated outputs. This study proposes a novel geometric approach to uncertainty quantification using convex hull analysis. The proposed method leverages the spatial properties of response embeddings to measure the dispersion and variability of model outputs. The prompts are categorized into three types, i.e., `easy', `moderate', and `confusing', to generate multiple responses using different LLMs at varying temperature settings. The responses are transformed into high-dimensional embeddings via a BERT model and subsequently projected into…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dense Connections · Weight Decay · Residual Connection · Multi-Head Attention · WordPiece · Softmax · Layer Normalization