JointCQ: Improving Factual Hallucination Detection with Joint Claim and Query Generation
Fan Xu, Huixuan Zhang, Zhenliang Zhang, Jiahao Wang, Xiaojun Wan

TL;DR
JointCQ is a novel framework that enhances factual hallucination detection in large language models by jointly generating claims and queries, leading to improved accuracy and reliability in identifying hallucinated content.
Contribution
The paper introduces a joint claim and query generation framework with a filtering strategy, significantly improving hallucination detection performance over existing methods.
Findings
Outperforms previous methods on multiple benchmarks
Enhances the reliability of hallucination detection
Provides more trustworthy language model outputs
Abstract
Current large language models (LLMs) often suffer from hallucination issues, i,e, generating content that appears factual but is actually unreliable. A typical hallucination detection pipeline involves response decomposition (i.e., claim extraction), query generation, evidence collection (i.e., search or retrieval), and claim verification. However, existing methods exhibit limitations in the first two stages, such as context loss during claim extraction and low specificity in query generation, resulting in degraded performance across the hallucination detection pipeline. In this work, we introduce JointCQ https://github.com/pku0xff/JointCQ, a joint claim-and-query generation framework designed to construct an effective and efficient claim-query generator. Our framework leverages elaborately designed evaluation criteria to filter synthesized training data, and finetunes a language model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Mobile Crowdsensing and Crowdsourcing
