Prompt, Translate, Fine-Tune, Re-Initialize, or Instruction-Tune? Adapting LLMs for In-Context Learning in Low-Resource Languages
Christopher Toukmaji, Jeffrey Flanigan

TL;DR
This study compares various adaptation techniques for low-resource language tasks in large language models, finding prompting methods outperform fine-tuning and introducing a new metric to analyze output quality degradation.
Contribution
It provides the largest comprehensive analysis of adaptation methods for low-resource languages in LLMs, including a novel metric and insights into catastrophic forgetting.
Findings
Few-shot prompting and translate-test outperform fine-tuning.
Catastrophic forgetting affects model outputs after training.
Largest study on low-resource language adaptation with diverse techniques.
Abstract
LLMs are typically trained in high-resource languages, and tasks in lower-resourced languages tend to underperform the higher-resource language counterparts for in-context learning. Despite the large body of work on prompting settings, it is still unclear how LLMs should be adapted cross-lingually specifically for in-context learning in the low-resource target languages. We perform a comprehensive study spanning five diverse target languages, three base LLMs, and seven downstream tasks spanning over 4,100 GPU training hours (9,900+ TFLOPs) across various adaptation techniques: few-shot prompting, translate-test, fine-tuning, embedding re-initialization, and instruction fine-tuning. Our results show that the few-shot prompting and translate-test settings tend to heavily outperform the gradient-based adaptation methods. To better understand this discrepancy, we design a novel metric,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ChrisToukmaji/focus_bur_llama_focus_trainedmodel
- 🤗ChrisToukmaji/focus_bur_mpt_focus_trainedmodel· 7 dl7 dl
- 🤗ChrisToukmaji/focus_bur_phi_focus_trainedmodel· 10 dl10 dl
- 🤗ChrisToukmaji/focus_hau_llama_focus_trainedmodel· 1 dl1 dl
- 🤗ChrisToukmaji/focus_hau_phi_focus_trainedmodel· 9 dl9 dl
- 🤗ChrisToukmaji/focus_kin_llama_focus_trainedmodel
- 🤗ChrisToukmaji/focus_kin_mpt_focus_trainedmodel· 2 dl2 dl
- 🤗ChrisToukmaji/focus_kin_phi_focus_trainedmodel
- 🤗ChrisToukmaji/focus_lug_llama_focus_trainedmodel· 2 dl2 dl
- 🤗ChrisToukmaji/focus_lug_mpt_focus_trainedmodel· 2 dl2 dl
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling
