Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments

Hao Mi; Qiang Sheng; Shaofei Wang; Beizhe Hu; Yifan Sun; Zhengjia Wang; Hengqi Zeng; Yang Li; Danding Wang; Juan Cao

arXiv:2605.03971·cs.CL·May 6, 2026

Logical Consistency as a Bridge: Improving LLM Hallucination Detection via Label Constraint Modeling between Responses and Self-Judgments

Hao Mi, Qiang Sheng, Shaofei Wang, Beizhe Hu, Yifan Sun, Zhengjia Wang, Hengqi Zeng, Yang Li, Danding Wang, Juan Cao

PDF

TL;DR

LaaB introduces a novel framework that enhances hallucination detection in LLMs by integrating neural features and symbolic judgments through logical consistency, improving reliability across multiple datasets and models.

Contribution

It proposes a meta-judgment process that maps symbolic labels into feature space, exploiting the logical relationship between responses and self-judgments for better hallucination detection.

Findings

01

LaaB outperforms 8 baselines on 4 datasets and models.

02

The framework effectively leverages logical consistency to improve detection accuracy.

03

Extensive experiments validate the superiority of LaaB across diverse settings.

Abstract

Large Language Models (LLMs) are prone to factual hallucinations, risking their reliability in real-world applications. Existing hallucination detectors mainly extract micro-level intrinsic patterns for uncertainty quantification or elicit macro-level self-judgments through verbalized prompts. However, these methods address only a single facet of the hallucination, focusing either on implicit neural uncertainty or explicit symbolic reasoning, thereby treating these inherently coupled behaviors in isolation and failing to exploit their interdependence for a holistic view. In this paper, we propose LaaB (Logical Consistency-as-a-Bridge), a framework that bridges neural features and symbolic judgments for hallucination detection. LaaB introduces a "meta-judgment" process to map symbolic labels back into the feature space. By leveraging the inherent logical bridge where response and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.