Lowest Span Confidence: A Zero-Shot Metric for Efficient and Black-Box Hallucination Detection in LLMs

Yitong Qiao; Licheng Pan; Yu Mi; Lei Liu; Yue Shen; Fei Sun; Zhixuan Chu

arXiv:2601.19918·cs.CL·January 29, 2026

Lowest Span Confidence: A Zero-Shot Metric for Efficient and Black-Box Hallucination Detection in LLMs

Yitong Qiao, Licheng Pan, Yu Mi, Lei Liu, Yue Shen, Fei Sun, Zhixuan Chu

PDF

Open Access

TL;DR

This paper introduces Lowest Span Confidence (LSC), a zero-shot, resource-efficient metric that detects hallucinations in large language models by analyzing local span likelihoods, outperforming existing methods across various benchmarks.

Contribution

The paper proposes LSC, a novel zero-shot hallucination detection metric that requires only a single model pass and effectively captures local uncertainty patterns, improving detection accuracy.

Findings

01

LSC outperforms existing zero-shot baselines in multiple benchmarks.

02

LSC is robust under resource-constrained conditions.

03

LSC effectively detects factual inconsistencies in LLM outputs.

Abstract

Hallucinations in Large Language Models (LLMs), i.e., the tendency to generate plausible but non-factual content, pose a significant challenge for their reliable deployment in high-stakes environments. However, existing hallucination detection methods generally operate under unrealistic assumptions, i.e., either requiring expensive intensive sampling strategies for consistency checks or white-box LLM states, which are unavailable or inefficient in common API-based scenarios. To this end, we propose a novel efficient zero-shot metric called Lowest Span Confidence (LSC) for hallucination detection under minimal resource assumptions, only requiring a single forward with output probabilities. Concretely, LSC evaluates the joint likelihood of semantically coherent spans via a sliding window mechanism. By identifying regions of lowest marginal confidence across variable-length n-grams, LSC…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Advanced Graph Neural Networks