Unveiling Linguistic Regions in Large Language Models

Zhihao Zhang; Jun Zhao; Qi Zhang; Tao Gui; Xuanjing Huang

arXiv:2402.14700·cs.CL·May 31, 2024·1 cites

Unveiling Linguistic Regions in Large Language Models

Zhihao Zhang, Jun Zhao, Qi Zhang, Tao Gui, Xuanjing Huang

PDF

Open Access 1 Repo

TL;DR

This paper investigates the intrinsic linguistic regions within large language models, revealing a core region vital for multilingual competence and demonstrating how perturbations affect language performance and how freezing this core can mitigate catastrophic forgetting.

Contribution

The study identifies a core linguistic region in LLMs, explores its properties, and shows how manipulating this region impacts multilingual performance and continual learning.

Findings

01

A core region accounts for ~1% of parameters and is crucial for linguistic competence.

02

Perturbing specific parameters causes significant performance drops across languages.

03

Freezing the core region during further training reduces catastrophic forgetting.

Abstract

Large Language Models (LLMs) have demonstrated considerable cross-lingual alignment and generalization ability. Current research primarily focuses on improving LLMs' cross-lingual generalization capabilities. However, there is still a lack of research on the intrinsic mechanisms of how LLMs achieve cross-lingual alignment. From the perspective of region partitioning, this paper conducts several investigations on the linguistic competence of LLMs. We discover a core region in LLMs that corresponds to linguistic competence, accounting for approximately 1% of the total model parameters. Removing this core region by setting parameters to zero results in a significant performance decrease across 30 different languages. Furthermore, this core region exhibits significant dimensional dependence, perturbations to even a single parameter on specific dimensions leading to a loss of linguistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zzhang0179/Unveiling-Linguistic-Regions-in-LLMs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling