TL;DR
The paper introduces Radial Dispersion Score (RDS), a simple, model-agnostic uncertainty metric for large language models that measures semantic variability via geometric dispersion in embedding space, outperforming existing methods.
Contribution
It proposes RDS, a novel, training-free uncertainty measure based on radial dispersion, which is scalable, effective, and extends to per-sample uncertainty estimation.
Findings
RDS outperforms nine recent state-of-the-art baselines in hallucination detection.
RDS remains robust and scalable across different sample sizes and embedding choices.
Code for RDS is publicly available at https://github.com/manhitv/RDS.
Abstract
Detecting uncertainty in large language models (LLMs) is essential for building reliable systems, yet many existing approaches are overly complex and depend on brittle semantic clustering or access to model internals. We introduce Radial Dispersion Score (RDS), a simple, training-free, fully model-agnostic uncertainty metric that measures the radial dispersion of sampled generations in embedding space. Specifically, given sampled generations embedded on the unit hypersphere, RDS computes the total l1 distance from the empirical centroid, i.e., the mean embedding, providing a direct geometric signal of semantic variability. A lightweight probability-weighted variant further incorporates the model's own token probabilities when available, outperforming nine recent state-of-the-art baselines. Moreover, RDS naturally extends to effective per-sample uncertainty estimates that complement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
