Franken-Adapter: Cross-Lingual Adaptation of LLMs by Embedding Surgery

Fan Jiang; Honglin Yu; Grace Chung; Trevor Cohn

arXiv:2502.08037·cs.CL·February 13, 2025

Franken-Adapter: Cross-Lingual Adaptation of LLMs by Embedding Surgery

Fan Jiang, Honglin Yu, Grace Chung, Trevor Cohn

PDF

Open Access

TL;DR

Franken-Adapter is a modular method for adapting large language models to low-resource languages through embedding surgery, improving multilingual performance with minimal English regression.

Contribution

It introduces a novel embedding surgery technique for cross-lingual adaptation of decoder-only LLMs, enhancing multilingual capabilities post-training.

Findings

01

Up to 20% performance improvement across 96 languages.

02

Minimal regressions (<1%) in English performance.

03

Versatile application to math-optimized LLMs with 14% gains.

Abstract

The capabilities of Large Language Models (LLMs) in low-resource languages lag far behind those in English, making their universal accessibility a significant challenge. To alleviate this, we present $Franken-Adapter$ , a modular language adaptation approach for decoder-only LLMs with embedding surgery. Our method begins by creating customized vocabularies for target languages and performing language adaptation through embedding tuning on multilingual data. These pre-trained embeddings are subsequently integrated with LLMs that have been instruction-tuned on English alignment data to enable zero-shot cross-lingual transfer. Our experiments on $Gemma2$ models with up to 27B parameters demonstrate improvements of up to 20% across 96 languages, spanning both discriminative and generative tasks, with minimal regressions ( $<$ 1%) in English. Further in-depth analysis reveals…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Interpreting and Communication in Healthcare · Translation Studies and Practices