Relational Linearity is a Predictor of Hallucinations
Yuetian Lu, Yihong Liu, and Hinrich Sch\"utze

TL;DR
This paper investigates how the linearity of relations in knowledge affects hallucination in large language models, revealing a strong correlation that informs future strategies to reduce hallucinations.
Contribution
It introduces SyntHal, a dataset for studying relation linearity and hallucinations, and demonstrates a significant correlation between relation linearity and hallucination rates in LLMs.
Findings
Strong correlation (r in [.78,.82]) between relation linearity and hallucination rate.
Medium-sized models frequently hallucinate on synthetic entities.
Linear relations are stored more abstractly, leading to higher hallucination susceptibility.
Abstract
Hallucination is a central failure mode in large language models (LLMs). We focus on hallucinations of answers to questions like: "Which instrument did Glenn Gould play?", but we ask these questions for synthetic entities that are unknown to the model. Surprisingly, we find that medium-size models like Gemma-7B-IT frequently hallucinate, i.e., they have difficulty recognizing that the hallucinated fact is not part of their knowledge. We hypothesize that an important factor in causing these hallucinations is the linearity of the relation: linear relations tend to be stored more abstractly, making it difficult for the LLM to assess its knowledge; the facts of nonlinear relations tend to be stored more directly, making knowledge assessment easier. To investigate this hypothesis, we create SyntHal, a dataset of 6000 synthetic entities for six relations. In our experiments with four models,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques
