The LLM Language Network: A Neuroscientific Approach for Identifying   Causally Task-Relevant Units

Badr AlKhamissi; Greta Tuckute; Antoine Bosselut; Martin Schrimpf

arXiv:2411.02280·cs.CL·February 14, 2025

The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units

Badr AlKhamissi, Greta Tuckute, Antoine Bosselut, Martin Schrimpf

PDF

Open Access 1 Repo 2 Videos

TL;DR

This paper investigates whether large language models (LLMs) develop specialized, causally important units for language processing, similar to the human brain, and explores their domain-specific specialization and alignment with brain activity.

Contribution

The study identifies language-selective units in 18 LLMs, demonstrates their causal importance for language tasks, and compares their specialization across different cognitive domains.

Findings

01

Language-selective units are causally critical for language tasks.

02

These units are more aligned with human brain activity than random units.

03

Some models show specialization for reasoning and social tasks, with notable differences among models.

Abstract

Large language models (LLMs) exhibit remarkable capabilities on not just language tasks, but also various tasks that are not linguistic in nature, such as logical reasoning and social inference. In the human brain, neuroscience has identified a core language system that selectively and causally supports language processing. We here ask whether similar specialization for language emerges in LLMs. We identify language-selective units within 18 popular LLMs, using the same localization approach that is used in neuroscience. We then establish the causal role of these units by demonstrating that ablating LLM language-selective units -- but not random units -- leads to drastic deficits in language tasks. Correspondingly, language-selective LLM units are more aligned to brain recordings from the human language system than random units. Finally, we investigate whether our localization method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bkhmsi/llm-localization
pytorchOfficial

Videos

The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units· underline

Taxonomy

TopicsTopic Modeling