Prompting Large Language Model for Machine Translation: A Case Study
Biao Zhang, Barry Haddow, Alexandra Birch

TL;DR
This paper systematically investigates prompting strategies for machine translation using large language models, exploring factors like prompt quality, example selection, and transfer learning, and analyzing their impact on translation performance.
Contribution
It provides the first comprehensive study on prompting for machine translation, examining various factors and transfer learning techniques to improve translation quality with large language models.
Findings
Prompt example quality significantly affects translation performance.
Semantic similarity of prompt examples correlates with prompting success.
Monolingual data and transfer learning can enhance translation results.
Abstract
Research on prompting has shown excellent performance with little or even no supervised training across many tasks. However, prompting for machine translation is still under-explored in the literature. We fill this gap by offering a systematic study on prompting strategies for translation, examining various factors for prompt template and demonstration example selection. We further explore the use of monolingual data and the feasibility of cross-lingual, cross-domain, and sentence-to-document transfer learning in prompting. Extensive experiments with GLM-130B (Zeng et al., 2022) as the testbed show that 1) the number and the quality of prompt examples matter, where using suboptimal examples degenerates translation; 2) several features of prompt examples, such as semantic similarity, show significant Spearman correlation with their prompting performance; yet, none of the correlations are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsNone
