A Multiple-Fill-in-the-Blank Exam Approach for Enhancing Zero-Resource   Hallucination Detection in Large Language Models

Satoshi Munakata; Taku Fukui; Takao Mohri

arXiv:2409.17173·cs.CL·September 27, 2024

A Multiple-Fill-in-the-Blank Exam Approach for Enhancing Zero-Resource Hallucination Detection in Large Language Models

Satoshi Munakata, Taku Fukui, Takao Mohri

PDF

Open Access

TL;DR

This paper introduces a novel multiple-fill-in-the-blank exam approach to improve zero-resource hallucination detection in large language models, ensuring storyline consistency and outperforming existing methods.

Contribution

It proposes a new hallucination detection method that maintains storyline alignment by masking objects and repeatedly prompting LLMs, addressing the issue of story changes in regenerated texts.

Findings

01

Outperforms existing hallucination detection methods.

02

Achieves state-of-the-art performance in ensemble settings.

03

Effectively maintains storyline consistency during detection.

Abstract

Large language models (LLMs) often fabricate a hallucinatory text. Several methods have been developed to detect such text by semantically comparing it with the multiple versions probabilistically regenerated. However, a significant issue is that if the storyline of each regenerated text changes, the generated texts become incomparable, which worsen detection accuracy. In this paper, we propose a hallucination detection method that incorporates a multiple-fill-in-the-blank exam approach to address this storyline-changing issue. First, our method creates a multiple-fill-in-the-blank exam by masking multiple objects from the original text. Second, prompts an LLM to repeatedly answer this exam. This approach ensures that the storylines of the exam answers align with the original ones. Finally, quantifies the degree of hallucination for each original sentence by scoring the exam answers,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification · Topic Modeling · Text Readability and Simplification

MethodsALIGN