Franken-Adapter: Cross-Lingual Adaptation of LLMs by Embedding Surgery
Fan Jiang, Honglin Yu, Grace Chung, Trevor Cohn

TL;DR
Franken-Adapter is a modular method for adapting large language models to low-resource languages through embedding surgery, improving multilingual performance with minimal English regression.
Contribution
It introduces a novel embedding surgery technique for cross-lingual adaptation of decoder-only LLMs, enhancing multilingual capabilities post-training.
Findings
Up to 20% performance improvement across 96 languages.
Minimal regressions (<1%) in English performance.
Versatile application to math-optimized LLMs with 14% gains.
Abstract
The capabilities of Large Language Models (LLMs) in low-resource languages lag far behind those in English, making their universal accessibility a significant challenge. To alleviate this, we present , a modular language adaptation approach for decoder-only LLMs with embedding surgery. Our method begins by creating customized vocabularies for target languages and performing language adaptation through embedding tuning on multilingual data. These pre-trained embeddings are subsequently integrated with LLMs that have been instruction-tuned on English alignment data to enable zero-shot cross-lingual transfer. Our experiments on models with up to 27B parameters demonstrate improvements of up to 20% across 96 languages, spanning both discriminative and generative tasks, with minimal regressions (1%) in English. Further in-depth analysis reveals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Interpreting and Communication in Healthcare · Translation Studies and Practices
