A Multifaceted Analysis of Negative Bias in Large Language Models through the Lens of Parametric Knowledge
Jongyoon Song, Sangwon Yu, Sungroh Yoon

TL;DR
This paper investigates the causes of negative bias in large language models, revealing that prompt format and knowledge sufficiency significantly influence biased responses, and explores methods to mitigate this bias.
Contribution
It introduces a systematic evaluation pipeline based on parametric knowledge and analyzes how prompt design affects negative bias in LLMs, offering new insights for bias mitigation.
Findings
Negative bias is influenced more by prompt format than semantics.
Lack of sufficient knowledge leads to increased negative responses.
Providing relevant context and 'I don't know' options reduces negative bias.
Abstract
Negative bias refers to the tendency of large language models (LLMs) to excessively generate negative responses in binary decision tasks (e.g., yes-no question answering). Previous research has focused on detecting and addressing negative attention heads that induce negative bias. However, the underlying detailed factors influencing negative bias remain underexplored. In this paper, we demonstrate that LLMs exhibit format-level negative bias, meaning the prompt format more influences their responses than the semantics of the negative response. For the fine-grained study of the negative bias, we introduce a pipeline for constructing the evaluation set, which systematically categorizes the dataset into three subsets based on the model's parametric knowledge: correct, incorrect, and insufficient relevant knowledge. Through analysis of this evaluation set, we identify a shortcut behavior in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Artificial Intelligence in Healthcare and Education
