Hallucination Detection with Small Language Models

Ming Cheung

arXiv:2506.22486·cs.CL·July 1, 2025

Hallucination Detection with Small Language Models

Ming Cheung

PDF

Open Access

TL;DR

This paper presents a scalable framework using multiple small language models to detect hallucinations in large language model responses by verifying answer consistency with retrieved context, improving detection accuracy.

Contribution

It introduces a novel approach combining small language models for hallucination detection, enhancing reliability without ground truth in question-answering tasks.

Findings

01

10% improvement in F1 scores for hallucination detection

02

Effective verification of partially correct responses

03

Scalable and efficient for practical applications

Abstract

Since the introduction of ChatGPT, large language models (LLMs) have demonstrated significant utility in various tasks, such as answering questions through retrieval-augmented generation. Context can be retrieved using a vectorized database, serving as a foundation for LLMs to generate responses. However, hallucinations in responses can undermine the reliability of LLMs in practical applications, and they are not easily detectable in the absence of ground truth, particularly in question-and-answer scenarios. This paper proposes a framework that integrates multiple small language models to verify responses generated by LLMs using the retrieved context from a vectorized database. By breaking down the responses into individual sentences and utilizing the probability of generating "Yes" tokens from the outputs of multiple models for a given set of questions, responses, and relevant context,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Misinformation and Its Impacts · Advanced Graph Neural Networks