Prompt, Translate, Fine-Tune, Re-Initialize, or Instruction-Tune? Adapting LLMs for In-Context Learning in Low-Resource Languages

Christopher Toukmaji; Jeffrey Flanigan

arXiv:2506.19187·cs.CL·June 25, 2025

Prompt, Translate, Fine-Tune, Re-Initialize, or Instruction-Tune? Adapting LLMs for In-Context Learning in Low-Resource Languages

Christopher Toukmaji, Jeffrey Flanigan

PDF

Open Access 10 Models 5 Datasets 1 Video

TL;DR

This study compares various adaptation techniques for low-resource language tasks in large language models, finding prompting methods outperform fine-tuning and introducing a new metric to analyze output quality degradation.

Contribution

It provides the largest comprehensive analysis of adaptation methods for low-resource languages in LLMs, including a novel metric and insights into catastrophic forgetting.

Findings

01

Few-shot prompting and translate-test outperform fine-tuning.

02

Catastrophic forgetting affects model outputs after training.

03

Largest study on low-resource language adaptation with diverse techniques.

Abstract

LLMs are typically trained in high-resource languages, and tasks in lower-resourced languages tend to underperform the higher-resource language counterparts for in-context learning. Despite the large body of work on prompting settings, it is still unclear how LLMs should be adapted cross-lingually specifically for in-context learning in the low-resource target languages. We perform a comprehensive study spanning five diverse target languages, three base LLMs, and seven downstream tasks spanning over 4,100 GPU training hours (9,900+ TFLOPs) across various adaptation techniques: few-shot prompting, translate-test, fine-tuning, embedding re-initialization, and instruction fine-tuning. Our results show that the few-shot prompting and translate-test settings tend to heavily outperform the gradient-based adaptation methods. To better understand this discrepancy, we design a novel metric,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

Prompt, Translate, Fine-Tune, Re-Initialize, or Instruction-Tune? Adapting LLMs for In-Context Learning in Low-Resource Languages· underline

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling