TL;DR
COMPASS is a data-centric framework that improves multilingual LLM performance by adaptive semantic sampling and continual adapter updates, addressing cross-lingual interference and data distribution shifts.
Contribution
It introduces a novel semantic sampling strategy and continual learning framework for multilingual LLM fine-tuning, enhancing transfer and robustness across languages.
Findings
Outperforms baseline methods on multilingual benchmarks.
Effectively handles unseen long-context tasks.
Maintains high performance in dynamic, real-world environments.
Abstract
Large language models (LLMs) often exhibit performance disparities across languages, with naive multilingual fine-tuning frequently degrading performance due to negative cross-lingual interference. To address this, we introduce COMPASS (COntinual Multilingual PEFT with Adaptive Semantic Sampling), a novel data-centric framework for adapting LLMs to target languages. COMPASS leverages parameter-efficient fine-tuning (PEFT) by training lightweight, language-specific adapters on a judiciously selected subset of auxiliary multilingual data. The core of our method is a distribution-aware sampling strategy that uses multilingual embeddings and clustering to identify semantic gaps between existing training data and a target usage distribution. By prioritizing auxiliary data from under-represented semantic clusters, COMPASS maximizes positive cross-lingual transfer while minimizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
