CoCo-CoLa: Evaluating and Improving Language Adherence in Multilingual LLMs
Elnaz Rahmati, Alireza S. Ziabari, Morteza Dehghani

TL;DR
This paper introduces CoCo-CoLa, a new metric for evaluating language adherence in multilingual LLMs, and proposes a partial fine-tuning method that improves language accuracy efficiently across multiple languages.
Contribution
The paper presents CoCo-CoLa for assessing language adherence and a novel partial fine-tuning approach targeting language-specific layers to enhance multilingual LLM performance.
Findings
Multilingual models share task knowledge but show language bias.
Final layers are crucial for output language determination.
Partial fine-tuning improves language adherence with less computation.
Abstract
Multilingual Large Language Models (LLMs) develop cross-lingual abilities despite being trained on limited parallel data. However, they often struggle to generate responses in the intended language, favoring high-resource languages such as English. In this work, we introduce CoCo-CoLa (Correct Concept - Correct Language), a novel metric to evaluate language adherence in multilingual LLMs. Using fine-tuning experiments on a closed-book QA task across seven languages, we analyze how training in one language affects others' performance. Our findings reveal that multilingual models share task knowledge across languages but exhibit biases in the selection of output language. We identify language-specific layers, showing that final layers play a crucial role in determining output language. Accordingly, we propose a partial training strategy that selectively fine-tunes key layers, improving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · linguistics and terminology studies · Text Readability and Simplification
