An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Language Model Inference
Atsuki Yamaguchi, Aline Villavicencio, Nikolaos Aletras

TL;DR
This study empirically evaluates five cross-lingual vocabulary adaptation methods, demonstrating significant inference speed improvements and comparable downstream performance across diverse languages and models.
Contribution
It provides the first comprehensive empirical analysis of CVA methods' impact on inference efficiency and performance in generative LLMs.
Findings
CVA methods can speed up inference by up to 271.5%.
Adapting models with balanced multilingual data maintains performance.
CVA improves inference efficiency across diverse languages and models.
Abstract
The development of state-of-the-art generative large language models (LLMs) disproportionately relies on English-centric tokenizers, vocabulary and pre-training data. Despite the fact that some LLMs have multilingual capabilities, recent studies have shown that their inference efficiency deteriorates when generating text in languages other than English. This results in increased inference time and costs. Cross-lingual vocabulary adaptation (CVA) methods have been proposed for adapting models to a target language aiming to improve downstream performance. However, the effectiveness of these methods on increasing inference efficiency of generative LLMs has yet to be explored. In this paper, we perform an empirical study of five CVA methods on four generative LLMs (including monolingual and multilingual models) across four typologically-diverse languages and four natural language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗atsuki-yamaguchi/bloom-1b1-lapt-demodel
- 🤗atsuki-yamaguchi/bloom-1b1-lapt-jamodel
- 🤗atsuki-yamaguchi/bloom-1b1-lapt-armodel
- 🤗atsuki-yamaguchi/bloom-1b1-lapt-swmodel
- 🤗atsuki-yamaguchi/bloom-1b1-clp-jamodel· 1 dl1 dl
- 🤗atsuki-yamaguchi/bloom-1b1-random-demodel· 1 dl1 dl
- 🤗atsuki-yamaguchi/bloom-1b1-random-jamodel· 1 dl1 dl
- 🤗atsuki-yamaguchi/bloom-7b1-lapt-armodel
- 🤗atsuki-yamaguchi/bloom-7b1-lapt-demodel
- 🤗atsuki-yamaguchi/bloom-7b1-lapt-jamodel
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
