Large Language Models are Skeptics: False Negative Problem of Input-conflicting Hallucination
Jongyoon Song, Sangwon Yu, Sungroh Yoon

TL;DR
This paper uncovers a bias in large language models causing false negatives in input-conflicting scenarios, leading to overconfidence in incorrect responses, and explores methods to mitigate this issue.
Contribution
It identifies the false negative problem in LLMs related to input-conflicting hallucinations and analyzes how context and rewriting strategies can reduce this bias.
Findings
LLMs tend to produce false negatives when input statements conflict.
Context and query rewriting can effectively reduce false negatives.
Models show overconfidence in incorrect responses under conflicting inputs.
Abstract
In this paper, we identify a new category of bias that induces input-conflicting hallucinations, where large language models (LLMs) generate responses inconsistent with the content of the input context. This issue we have termed the false negative problem refers to the phenomenon where LLMs are predisposed to return negative judgments when assessing the correctness of a statement given the context. In experiments involving pairs of statements that contain the same information but have contradictory factual directions, we observe that LLMs exhibit a bias toward false negatives. Specifically, the model presents greater overconfidence when responding with False. Furthermore, we analyze the relationship between the false negative problem and context and query rewriting and observe that both effectively tackle false negatives in LLMs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis
