Exploring In-context Example Generation for Machine Translation
Dohyun Lee, Seungil Chad Lee, Chanwoo Yang, Yujin Baek, Jaegul Choo

TL;DR
This paper introduces Demonstration Augmentation for Translation (DAT), a method that generates in-context examples for machine translation without external resources, improving translation quality especially for low-resource languages.
Contribution
It proposes a novel in-context example generation approach for machine translation that does not depend on human-annotated data, addressing low-resource language challenges.
Findings
DAT outperforms baselines in low-resource language translation
Generated example pairs improve translation quality
Progressive accumulation of pairs enhances performance
Abstract
Large language models (LLMs) have demonstrated strong performance across various tasks, leveraging their exceptional in-context learning ability with only a few examples. Accordingly, the selection of optimal in-context examples has been actively studied in the field of machine translation. However, these studies presuppose the presence of a demonstration pool with human-annotated pairs, making them less applicable to low-resource languages where such an assumption is challenging to meet. To overcome this limitation, this paper explores the research direction of in-context example generation for machine translation. Specifically, we propose Demonstration Augmentation for Translation (DAT), a simple yet effective approach that generates example pairs without relying on any external resources. This method builds upon two prior criteria, relevance and diversity, which have been highlighted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
