SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Potsawee Manakul, Adian Liusie, Mark J. F. Gales

TL;DR
SelfCheckGPT is a zero-resource, sampling-based method for detecting hallucinations in black-box large language models by analyzing the consistency of generated responses without external data.
Contribution
It introduces a simple, effective approach for fact-checking black-box LLM outputs without external databases, leveraging response consistency to identify hallucinations.
Findings
High accuracy in sentence-level hallucination detection
Effective passage ranking based on factuality
Outperforms existing grey-box methods in key metrics
Abstract
Generative Large Language Models (LLMs) such as GPT-3 are capable of generating highly fluent responses to a wide variety of user prompts. However, LLMs are known to hallucinate facts and make non-factual statements which can undermine trust in their output. Existing fact-checking approaches either require access to the output probability distribution (which may not be available for systems such as ChatGPT) or external databases that are interfaced via separate, often complex, modules. In this work, we propose "SelfCheckGPT", a simple sampling-based approach that can be used to fact-check the responses of black-box models in a zero-resource fashion, i.e. without an external database. SelfCheckGPT leverages the simple idea that if an LLM has knowledge of a given concept, sampled responses are likely to be similar and contain consistent facts. However, for hallucinated facts,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification · Machine Learning in Healthcare
Methods{Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · 15 Ways to Contact How can i speak to someone at Delta Airlines · Weight Decay · Dropout · Cosine Annealing · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia?
