Pruning Multilingual Large Language Models for Multilingual Inference

Hwichan Kim; Jun Suzuki; Tosho Hirasawa; Mamoru Komachi

arXiv:2409.16911·cs.CL·October 3, 2024

Pruning Multilingual Large Language Models for Multilingual Inference

Hwichan Kim, Jun Suzuki, Tosho Hirasawa, Mamoru Komachi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper proposes a pruning method that leverages large magnitude features in multilingual language models to improve their zero-shot performance in non-English languages, addressing performance disparities.

Contribution

It introduces a novel pruning approach based on large magnitude features to enhance multilingual models' non-English inference capabilities.

Findings

01

Pruning based on large magnitude features improves non-English zero-shot performance.

02

The method retains translation-relevant features, boosting multilingual inference.

03

Empirical results show significant performance gains in non-English tasks.

Abstract

Multilingual large language models (MLLMs), trained on multilingual balanced data, demonstrate better zero-shot learning performance in non-English languages compared to large language models trained on English-dominant data. However, the disparity in performance between English and non-English languages remains a challenge yet to be fully addressed. A distinctive characteristic of MLLMs is their high-quality translation capabilities, indicating an acquired proficiency in aligning between languages. This study explores how to enhance the zero-shot performance of MLLMs in non-English languages by leveraging their alignment capability between English and non-English languages. To achieve this, we first analyze the behavior of MLLMs when performing translation and reveal that there are large magnitude features that play a critical role in the translation process. Inspired by these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hwichan0720/pruning_for_multilinguality
pytorchOfficial

Videos

Pruning Multilingual Large Language Models for Multilingual Inference· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsPruning