Do Multilingual LLMs have specialized language heads?

Muhammad Naufil

arXiv:2602.08625·cs.CL·February 10, 2026

Do Multilingual LLMs have specialized language heads?

Muhammad Naufil

PDF

Open Access

TL;DR

This paper investigates whether multilingual LLMs have language-specific attention heads and explores the potential for removing unnecessary heads to improve deployment efficiency without sacrificing performance.

Contribution

It is the first study to analyze language head specialization in multilingual LLMs and assess the impact of removing language-specific heads on model performance.

Findings

01

Multilingual LLMs exhibit some degree of language-specific attention heads.

02

Removing language-specific heads for certain languages can reduce model complexity.

03

Performance on targeted languages remains stable after removing irrelevant language heads.

Abstract

Multilingual large language models (LLMs) have gained significant popularity for their ability to process and generate text across multiple languages. However, deploying these models in production can be inefficient when only a subset of the supported languages is of interest. There has been some research conducted on identifying whether machine translation models have language-specific or language-agnostic heads, however no research has been conducted for multilingual LLMs, to the best of our knowledge, that as we know are capable of performing diverse tasks beyond just translation. This paper explores whether multilingual LLMs have specialized language attention heads for each language, and investigates the possibility of removing language-specific heads for unwanted languages without degrading performance in the targeted languages. Our findings could inform more efficient deployment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification