Uncertainty Quantification in Large Language Models Through Convex Hull Analysis
Ferhat Ozgur Catak, Murat Kuzlu

TL;DR
This paper introduces a geometric method using convex hull analysis to quantify uncertainty in large language models by analyzing response embeddings and their dispersion across different prompt complexities and settings.
Contribution
It presents a novel approach leveraging convex hulls and clustering of embeddings to measure uncertainty in LLM outputs, addressing limitations of traditional probabilistic methods.
Findings
Uncertainty varies with prompt complexity, model, and temperature.
Convex hull analysis effectively captures output variability.
Embedding dispersion correlates with response confidence.
Abstract
Uncertainty quantification approaches have been more critical in large language models (LLMs), particularly high-risk applications requiring reliable outputs. However, traditional methods for uncertainty quantification, such as probabilistic models and ensemble techniques, face challenges when applied to the complex and high-dimensional nature of LLM-generated outputs. This study proposes a novel geometric approach to uncertainty quantification using convex hull analysis. The proposed method leverages the spatial properties of response embeddings to measure the dispersion and variability of model outputs. The prompts are categorized into three types, i.e., `easy', `moderate', and `confusing', to generate multiple responses using different LLMs at varying temperature settings. The responses are transformed into high-dimensional embeddings via a BERT model and subsequently projected into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dense Connections · Weight Decay · Residual Connection · Multi-Head Attention · WordPiece · Softmax · Layer Normalization
