Bridging the Language Gaps in Large Language Models with Inference-Time   Cross-Lingual Intervention

Weixuan Wang; Minghao Wu; Barry Haddow; Alexandra Birch

arXiv:2410.12462·cs.CL·October 17, 2024

Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention

Weixuan Wang, Minghao Wu, Barry Haddow, Alexandra Birch

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces INCLINE, a cost-effective inference-time method that improves low-resource language performance in large language models by aligning their internal representations with high-resource languages without retraining.

Contribution

The paper presents INCLINE, a novel inference-time framework for cross-lingual alignment that enhances LLM performance on low-resource languages without additional training.

Findings

01

INCLINE significantly improves multilingual task performance.

02

The method is highly cost-effective and widely applicable.

03

Experimental results outperform recent baselines.

Abstract

Large Language Models (LLMs) have shown remarkable capabilities in natural language processing but exhibit significant performance gaps among different languages. Most existing approaches to address these disparities rely on pretraining or fine-tuning, which are resource-intensive. To overcome these limitations without incurring significant costs, we propose Inference-Time Cross-Lingual Intervention (INCLINE), a novel framework that enhances LLM performance on low-performing (source) languages by aligning their internal representations with those of high-performing (target) languages during inference. INCLINE initially learns alignment matrices using parallel sentences from source and target languages through a Least-Squares optimization, and then applies these matrices during inference to transform the low-performing language representations toward the high-performing language space.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

weixuan-wang123/INCLINE
pytorchOfficial

Videos

Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis