Unveiling Linguistic Regions in Large Language Models
Zhihao Zhang, Jun Zhao, Qi Zhang, Tao Gui, Xuanjing Huang

TL;DR
This paper investigates the intrinsic linguistic regions within large language models, revealing a core region vital for multilingual competence and demonstrating how perturbations affect language performance and how freezing this core can mitigate catastrophic forgetting.
Contribution
The study identifies a core linguistic region in LLMs, explores its properties, and shows how manipulating this region impacts multilingual performance and continual learning.
Findings
A core region accounts for ~1% of parameters and is crucial for linguistic competence.
Perturbing specific parameters causes significant performance drops across languages.
Freezing the core region during further training reduces catastrophic forgetting.
Abstract
Large Language Models (LLMs) have demonstrated considerable cross-lingual alignment and generalization ability. Current research primarily focuses on improving LLMs' cross-lingual generalization capabilities. However, there is still a lack of research on the intrinsic mechanisms of how LLMs achieve cross-lingual alignment. From the perspective of region partitioning, this paper conducts several investigations on the linguistic competence of LLMs. We discover a core region in LLMs that corresponds to linguistic competence, accounting for approximately 1% of the total model parameters. Removing this core region by setting parameters to zero results in a significant performance decrease across 30 different languages. Furthermore, this core region exhibits significant dimensional dependence, perturbations to even a single parameter on specific dimensions leading to a loss of linguistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
