Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?
Seth Aycock, David Stap, Di Wu, Christof Monz, Khalil Sima'an

TL;DR
This paper investigates how low-resource languages can be translated using LLMs and finds that parallel examples are more effective than grammatical explanations, emphasizing the importance of task-specific data collection.
Contribution
The study demonstrates that parallel examples, rather than grammatical explanations, primarily drive translation improvements for low-resource languages in LLMs.
Findings
Parallel examples are the main source of translation improvements.
Fine-tuning models with parallel data achieves comparable results to LLM prompting.
Grammatical explanations do not significantly enhance translation performance.
Abstract
Extremely low-resource (XLR) languages lack substantial corpora for training NLP models, motivating the use of all available resources such as dictionaries and grammar books. Machine Translation from One Book (Tanzer et al., 2024) suggests that prompting long-context LLMs with one grammar book enables English-Kalamang translation, an XLR language unseen by LLMs - a noteworthy case of linguistics helping an NLP task. We investigate the source of this translation ability, finding almost all improvements stem from the book's parallel examples rather than its grammatical explanations. We find similar results for Nepali and Guarani, seen low-resource languages, and we achieve performance comparable to an LLM with a grammar book by simply fine-tuning an encoder-decoder translation model. We then investigate where grammar books help by testing two linguistic tasks, grammaticality judgment and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing
