TL;DR
This paper investigates language-specific neurons in large multilingual models, identifying their properties and demonstrating how systematic neuron manipulation can control language behavior and improve multilingual task performance.
Contribution
It introduces the LAPE method for identifying language neurons and demonstrates systematic neuron activation manipulation to steer model behavior across multiple languages and tasks.
Findings
Neurons cluster in deeper layers and are more specialized for non-Latin scripts.
Shared neurons reflect linguistic proximity among related languages.
Neuron manipulation improves multilingual task performance and language control.
Abstract
Large language models (LLMs) exhibit strong multilingual abilities, yet the neural mechanisms behind language-specific processing remain unclear. We analyze language-specific neurons in Llama-3.1-8B, Mistral-Nemo-12B, and Aya-Expanse-8B & 32B across 21 typologically diverse languages, identifying neurons that control language behavior. Using the Language Activation Probability Entropy (LAPE) method, we show that these neurons cluster in deeper layers, with non-Latin scripts showing greater specialization. Related languages share overlapping neurons, reflecting internal representations of linguistic proximity. Through language arithmetics, i.e. systematic activation addition and multiplication, we steer models to deactivate unwanted languages and activate desired ones, outperforming simpler replacement approaches. These interventions effectively guide behavior across five multilingual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗DGurgurov/llama-3.1-8b-mlt_latnmodel
- 🤗DGurgurov/llama-3.1-8b-afr_latnmodel· 1 dl1 dl
- 🤗DGurgurov/llama-3.1-8b-isl_latnmodel
- 🤗DGurgurov/llama-3.1-8b-cym_latnmodel· 1 dl1 dl
- 🤗DGurgurov/llama-3.1-8b-mkd_cyrlmodel· 1 dl1 dl
- 🤗DGurgurov/llama-3.1-8b-lvs_latnmodel
- 🤗DGurgurov/llama-3.1-8b-lit_latnmodel· 4 dl· ♡ 14 dl♡ 1
- 🤗DGurgurov/llama-3.1-8b-slv_latnmodel
- 🤗DGurgurov/llama-3.1-8b-slk_latnmodel· 2 dl2 dl
- 🤗DGurgurov/llama-3.1-8b-ekk_latnmodel· 1 dl· ♡ 11 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
