COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling

Noah Flynn

arXiv:2604.20720·cs.LG·April 23, 2026

COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling

Noah Flynn

PDF

1 Repo

TL;DR

COMPASS is a data-centric framework that improves multilingual LLM performance by adaptive semantic sampling and continual adapter updates, addressing cross-lingual interference and data distribution shifts.

Contribution

It introduces a novel semantic sampling strategy and continual learning framework for multilingual LLM fine-tuning, enhancing transfer and robustness across languages.

Findings

01

Outperforms baseline methods on multilingual benchmarks.

02

Effectively handles unseen long-context tasks.

03

Maintains high performance in dynamic, real-world environments.

Abstract

Large language models (LLMs) often exhibit performance disparities across languages, with naive multilingual fine-tuning frequently degrading performance due to negative cross-lingual interference. To address this, we introduce COMPASS (COntinual Multilingual PEFT with Adaptive Semantic Sampling), a novel data-centric framework for adapting LLMs to target languages. COMPASS leverages parameter-efficient fine-tuning (PEFT) by training lightweight, language-specific adapters on a judiciously selected subset of auxiliary multilingual data. The core of our method is a distribution-aware sampling strategy that uses multilingual embeddings and clustering to identify semantic gaps between existing training data and a target usage distribution. By prioritizing auxiliary data from under-represented semantic clusters, COMPASS maximizes positive cross-lingual transfer while minimizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arxiv-to-code/arxiv-260420720-compass-continual-multilingual-peft-with-adaptive-semantic-s
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.