Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages

Jannik Brinkmann; Chris Wendler; Christian Bartelt; Aaron Mueller

arXiv:2501.06346·cs.CL·May 26, 2025

Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages

Jannik Brinkmann, Chris Wendler, Christian Bartelt, Aaron Mueller

PDF

1 Repo 1 Video

TL;DR

This paper investigates how large language models encode morphosyntactic concepts across diverse languages, revealing shared representations and demonstrating their causal role in multilingual tasks like translation.

Contribution

It introduces a method to identify and manipulate shared grammatical features in LLMs, showing these features are robust and cross-lingually consistent.

Findings

01

Shared grammatical features are encoded in feature directions across languages.

02

Ablating these features reduces multilingual classifier performance.

03

Modifying features can alter model behavior in translation tasks.

Abstract

Human bilinguals often use similar brain regions to process multiple languages, depending on when they learned their second language and their proficiency. In large language models (LLMs), how are multiple languages learned and encoded? In this work, we explore the extent to which LLMs share representations of morphsyntactic concepts such as grammatical number, gender, and tense across languages. We train sparse autoencoders on Llama-3-8B and Aya-23-8B, and demonstrate that abstract grammatical concepts are often encoded in feature directions shared across many languages. We use causal interventions to verify the multilingual nature of these representations; specifically, we show that ablating only multilingual features decreases classifier performance to near-chance across languages. We then use these features to precisely modify model behavior in a machine translation task; this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jannik-brinkmann/multilingual-features
pytorchOfficial

Videos

Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages· underline