Large Language Models are Skeptics: False Negative Problem of   Input-conflicting Hallucination

Jongyoon Song; Sangwon Yu; Sungroh Yoon

arXiv:2406.13929·cs.CL·June 21, 2024·1 cites

Large Language Models are Skeptics: False Negative Problem of Input-conflicting Hallucination

Jongyoon Song, Sangwon Yu, Sungroh Yoon

PDF

Open Access

TL;DR

This paper uncovers a bias in large language models causing false negatives in input-conflicting scenarios, leading to overconfidence in incorrect responses, and explores methods to mitigate this issue.

Contribution

It identifies the false negative problem in LLMs related to input-conflicting hallucinations and analyzes how context and rewriting strategies can reduce this bias.

Findings

01

LLMs tend to produce false negatives when input statements conflict.

02

Context and query rewriting can effectively reduce false negatives.

03

Models show overconfidence in incorrect responses under conflicting inputs.

Abstract

In this paper, we identify a new category of bias that induces input-conflicting hallucinations, where large language models (LLMs) generate responses inconsistent with the content of the input context. This issue we have termed the false negative problem refers to the phenomenon where LLMs are predisposed to return negative judgments when assessing the correctness of a statement given the context. In experiments involving pairs of statements that contain the same information but have contradictory factual directions, we observe that LLMs exhibit a bias toward false negatives. Specifically, the model presents greater overconfidence when responding with False. Furthermore, we analyze the relationship between the false negative problem and context and query rewriting and observe that both effectively tackle false negatives in LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis