A New Benchmark and Reverse Validation Method for Passage-level   Hallucination Detection

Shiping Yang; Renliang Sun; Xiaojun Wan

arXiv:2310.06498·cs.CL·October 25, 2023

A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection

Shiping Yang, Renliang Sun, Xiaojun Wan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new benchmark called PHD and a reverse validation method for passage-level hallucination detection in LLMs, demonstrating improved accuracy and efficiency over existing zero-resource approaches.

Contribution

It presents a novel passage-level hallucination detection benchmark and a self-check method based on reverse validation, advancing zero-resource detection techniques.

Findings

01

The proposed method outperforms baselines in accuracy.

02

It requires fewer tokens and less time.

03

Shared limitations of zero-resource methods are identified.

Abstract

Large Language Models (LLMs) have shown their ability to collaborate effectively with humans in real-world scenarios. However, LLMs are apt to generate hallucinations, i.e., makeup incorrect text and unverified information, which can cause significant damage when deployed for mission-critical tasks. In this paper, we propose a self-check approach based on reverse validation to detect factual errors automatically in a zero-resource fashion. To facilitate future studies and assess different methods, we construct a hallucination detection benchmark named PHD, which is generated by ChatGPT and annotated by human annotators. Contrasting previous studies of zero-resource hallucination detection, our method and benchmark concentrate on passage-level detection instead of sentence-level. We empirically evaluate our method and existing zero-resource detection methods on two datasets. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maybenotime/phd
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Topic Modeling