Hallucination Detection with Small Language Models
Ming Cheung

TL;DR
This paper presents a scalable framework using multiple small language models to detect hallucinations in large language model responses by verifying answer consistency with retrieved context, improving detection accuracy.
Contribution
It introduces a novel approach combining small language models for hallucination detection, enhancing reliability without ground truth in question-answering tasks.
Findings
10% improvement in F1 scores for hallucination detection
Effective verification of partially correct responses
Scalable and efficient for practical applications
Abstract
Since the introduction of ChatGPT, large language models (LLMs) have demonstrated significant utility in various tasks, such as answering questions through retrieval-augmented generation. Context can be retrieved using a vectorized database, serving as a foundation for LLMs to generate responses. However, hallucinations in responses can undermine the reliability of LLMs in practical applications, and they are not easily detectable in the absence of ground truth, particularly in question-and-answer scenarios. This paper proposes a framework that integrates multiple small language models to verify responses generated by LLMs using the retrieved context from a vectorized database. By breaking down the responses into individual sentences and utilizing the probability of generating "Yes" tokens from the outputs of multiple models for a given set of questions, responses, and relevant context,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Misinformation and Its Impacts · Advanced Graph Neural Networks
