Low-Dimensional Structure in the Space of Language Representations is Reflected in Brain Responses

Richard Antonello; Javier Turek; Vy Vo; and Alexander Huth

arXiv:2106.05426·cs.CL·December 11, 2025·24 cites

Low-Dimensional Structure in the Space of Language Representations is Reflected in Brain Responses

Richard Antonello, Javier Turek, Vy Vo, and Alexander Huth

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper uncovers a low-dimensional structure in language model representations that correlates with brain responses, revealing a shared underlying organization of language processing in models and the brain.

Contribution

It introduces a transfer learning method to map and analyze the structure of language representations across models and brain responses, revealing a shared low-dimensional embedding.

Findings

01

Language models and translation models interpolate between linguistic features.

02

The embedding predicts brain responses to language stimuli.

03

The principal dimension reflects the brain's language hierarchy.

Abstract

How related are the representations learned by neural language models, translation models, and language tagging tasks? We answer this question by adapting an encoder-decoder transfer learning method from computer vision to investigate the structure among 100 different feature spaces extracted from hidden representations of various networks trained on language tasks. This method reveals a low-dimensional structure where language models and translation models smoothly interpolate between word embeddings, syntactic and semantic tasks, and future word embeddings. We call this low-dimensional structure a language representation embedding because it encodes the relationships between representations needed to process language for a variety of NLP tasks. We find that this representation embedding can predict how well each individual feature space maps to human brain responses to natural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

huthlab/rep_structure
noneOfficial

Videos

Low-dimensional Structure in the Space of Language Representations is Reflected in Brain Responses· slideslive

Taxonomy

TopicsTopic Modeling · Neurobiology of Language and Bilingualism · Natural Language Processing Techniques