LLMs Are Few-Shot In-Context Low-Resource Language Learners

Samuel Cahyawijaya; Holy Lovenia; Pascale Fung

arXiv:2403.16512·cs.CL·June 26, 2024·3 cites

LLMs Are Few-Shot In-Context Low-Resource Language Learners

Samuel Cahyawijaya, Holy Lovenia, Pascale Fung

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the effectiveness of in-context learning with large language models for low-resource languages, proposing a new query alignment method to improve semantic understanding and cross-lingual transfer.

Contribution

It provides a comprehensive study of ICL in low-resource languages, introduces query alignment as an alternative to label alignment, and offers insights into enhancing LLM performance in underrepresented languages.

Findings

01

ICL significantly improves low-resource language understanding.

02

Query alignment outperforms label alignment in cross-lingual tasks.

03

Semantic relevance in in-context examples enhances model performance.

Abstract

In-context learning (ICL) empowers large language models (LLMs) to perform diverse tasks in underrepresented languages using only short in-context information, offering a crucial avenue for narrowing the gap between high-resource and low-resource languages. Nonetheless, there is only a handful of works explored ICL for low-resource languages with most of them focusing on relatively high-resource languages, such as French and Spanish. In this work, we extensively study ICL and its cross-lingual variation (X-ICL) on 25 low-resource and 7 relatively higher-resource languages. Our study not only assesses the effectiveness of ICL with LLMs in low-resource languages but also identifies the shortcomings of in-context label alignment, and introduces a more effective alternative: query alignment. Moreover, we provide valuable insights into various facets of ICL for low-resource languages. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

samuelcahyawijaya/in-context-alignment
pytorchOfficial

Videos

LLMs Are Few-Shot In-Context Low-Resource Language Learners· underline

Taxonomy

TopicsNatural Language Processing Techniques