Why Low-Resource NLP Needs More Than Cross-Lingual Transfer: Lessons Learned from Luxembourgish
Fred Philippy, Siwen Guo, Jacques Klein, Tegawend\'e F. Bissyand\'e

TL;DR
This paper examines the interplay between cross-lingual transfer and language-specific efforts in low-resource NLP, emphasizing their complementary roles and providing practical guidelines for Luxembourgish.
Contribution
It offers a nuanced analysis of how cross-lingual transfer and language-specific efforts jointly enhance low-resource NLP, with insights from Luxembourgish data.
Findings
Cross-lingual transfer improves low-resource language performance.
High-quality, task-aligned data is crucial for transfer success.
Resources alone are insufficient without cross-lingual frameworks.
Abstract
Cross-lingual transfer has become a central paradigm for extending natural language processing (NLP) technologies to low-resource languages. By leveraging supervision from high-resource languages, multilingual language models can achieve strong task performance with little or no labeled target-language data. However, it remains unclear to what extent cross-lingual transfer can substitute for language-specific efforts. In this paper, we synthesize prior research findings and data collection results on Luxembourgish, which, despite its typological proximity to high-resource languages and its presence in a multilingual context, remains insufficiently represented in modern NLP technologies. Across findings, we observe a fundamental interdependence between cross-lingual transfer and language-specific efforts. Cross-lingual transfer can substantially improve target-language performance, but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
