Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages
Daniil Gurgurov, Ivan Vykopal, Josef van Genabith, Simon Ostermann

TL;DR
This paper explores efficient adapter-based methods for adapting small multilingual models to low-resource languages, demonstrating that small models with structured adaptation outperform larger models in low-data scenarios.
Contribution
It systematically evaluates parameter-efficient adapters for small multilingual models, showing their effectiveness in low-resource language tasks with limited data.
Findings
Sequential Bottleneck adapters excel in language modeling.
Invertible Bottleneck adapters outperform others on downstream tasks.
Small models outperform large LLMs in low-resource language adaptation.
Abstract
Low-resource languages (LRLs) face significant challenges in natural language processing (NLP) due to limited data. While current state-of-the-art large language models (LLMs) still struggle with LRLs, smaller multilingual models (mLMs) such as mBERT and XLM-R offer greater promise due to a better fit of their capacity to low training data sizes. This study systematically investigates parameter-efficient adapter-based methods for adapting mLMs to LRLs, evaluating three architectures: Sequential Bottleneck, Invertible Bottleneck, and Low-Rank Adaptation. Using unstructured text from GlotCC and structured knowledge from ConceptNet, we show that small adaptation datasets (e.g., up to 1 GB of free-text or a few MB of knowledge graph data) yield gains in intrinsic (masked language modeling) and extrinsic tasks (topic classification, sentiment analysis, and named entity recognition). We find…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗DGurgurov/mbert_slv-latnmodel· 2 dl2 dl
- 🤗DGurgurov/mbert_mlt-latnmodel
- 🤗DGurgurov/mbert_uzn-latnmodel· 1 dl1 dl
- 🤗DGurgurov/mbert_mar-devamodel· 2 dl2 dl
- 🤗DGurgurov/mbert_lvs-latnmodel
- 🤗DGurgurov/mbert_mkd-cyrlmodel· 1 dl1 dl
- 🤗DGurgurov/mbert_ben-bengmodel· 2 dl2 dl
- 🤗DGurgurov/mbert_bod-tibtmodel· 1 dl1 dl
- 🤗DGurgurov/mbert_uig-arabmodel· 2 dl2 dl
- 🤗DGurgurov/mbert_yor-latnmodel· 1 dl1 dl
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
