Drowzee: Metamorphic Testing for Fact-Conflicting Hallucination   Detection in Large Language Models

Ningke Li; Yuekang Li; Yi Liu; Ling Shi; Kailong Wang; Haoyu Wang

arXiv:2405.00648·cs.SE·September 4, 2024·1 cites

Drowzee: Metamorphic Testing for Fact-Conflicting Hallucination Detection in Large Language Models

Ningke Li, Yuekang Li, Yi Liu, Ling Shi, Kailong Wang, Haoyu Wang

PDF

Open Access

TL;DR

This paper introduces Drowzee, a logic-based metamorphic testing approach to detect fact-conflicting hallucinations in large language models, addressing dataset creation and reasoning validation challenges.

Contribution

It presents a novel logic programming method for generating diverse test cases and validating LLM outputs to effectively identify hallucinations.

Findings

01

Hallucination rates ranged from 24.7% to 59.8% across models.

02

LLMs struggle with temporal concepts and out-of-distribution knowledge.

03

Logic-based test cases effectively trigger and detect hallucinations.

Abstract

Large language models (LLMs) have transformed the landscape of language processing, yet struggle with significant challenges in terms of security, privacy, and the generation of seemingly coherent but factually inaccurate outputs, commonly referred to as hallucinations. Among these challenges, one particularly pressing issue is Fact-Conflicting Hallucination (FCH), where LLMs generate content that directly contradicts established facts. Tackling FCH poses a formidable task due to two primary obstacles: Firstly, automating the construction and updating of benchmark datasets is challenging, as current methods rely on static benchmarks that don't cover the diverse range of FCH scenarios. Secondly, validating LLM outputs' reasoning process is inherently complex, especially with intricate logical relations involved. In addressing these obstacles, we propose an innovative approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPharmacovigilance and Adverse Drug Reactions