Linear Correlation in LM's Compositional Generalization and Hallucination
Letian Peng, Chenyang An, Shibo Hao, Chengyu Dong, Jingbo Shang

TL;DR
This paper reveals linear correlations in language models during knowledge composition, showing how these transformations relate to generalization and hallucination, and proposes methods to identify and learn these correlations.
Contribution
It uncovers the existence of linear transformations in LMs for knowledge composition and demonstrates their role in generalization and hallucination, offering new insights into LM behavior.
Findings
Linear transformations exist between related knowledge prompts in LMs.
Linear correlation can predict LM generalization and hallucination.
Linear correlations can be learned with a simple feedforward network.
Abstract
The generalization of language models (LMs) is undergoing active debates, contrasting their potential for general intelligence with their struggles with basic knowledge composition (e.g., reverse/transition curse). This paper uncovers the phenomenon of linear correlations in LMs during knowledge composition. For explanation, there exists a linear transformation between certain related knowledge that maps the next token prediction logits from one prompt to another, e.g., "X lives in the city of" "X lives in the country of" for every given X. This mirrors the linearity in human knowledge composition, such as Paris France. Our findings indicate that the linear transformation is resilient to large-scale fine-tuning, generalizing updated knowledge when aligned with real-world relationships, but causing hallucinations when it deviates. Empirical results suggest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHallucinations in medical conditions · Geochemistry and Geologic Mapping
MethodsDense Connections · Feedforward Network
