On Relation-Specific Neurons in Large Language Models
Yihong Liu, Runsheng Chen, Lea Hirlimann, Ahmad Dawar Hakimi, Mingyang Wang, Amir Hossein Kargaran, Sascha Rothe, Fran\c{c}ois Yvon, Hinrich Sch\"utze

TL;DR
This paper investigates whether certain neurons in large language models specifically encode relation information, revealing their properties and effects on factual recall and relation processing.
Contribution
It provides evidence for the existence of relation-specific neurons and characterizes their properties, such as cumulativity, versatility, and interference effects.
Findings
Relation-specific neurons exist in LLMs.
Deactivating relation neurons affects factual recall.
Neurons can transfer across relations and languages.
Abstract
In large language models (LLMs), certain \emph{neurons} can store distinct pieces of knowledge learned during pretraining. While factual knowledge typically appears as a combination of \emph{relations} and \emph{entities}, it remains unclear whether some neurons focus on a relation itself -- independent of any entity. We hypothesize such neurons \emph{detect} a relation in the input text and \emph{guide} generation involving such a relation. To investigate this, we study the LLama-2 family on a chosen set of relations, with a \textit{statistics}-based method. Our experiments demonstrate the existence of relation-specific neurons. We measure the effect of selectively deactivating candidate neurons specific to relation on the LLM's ability to handle (1) facts involving relation and (2) facts involving a different relation . With respect to their capacity for encoding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsSparse Evolutionary Training · Focus
