Exploring In-context Example Generation for Machine Translation

Dohyun Lee; Seungil Chad Lee; Chanwoo Yang; Yujin Baek; Jaegul Choo

arXiv:2506.00507·cs.CL·June 3, 2025

Exploring In-context Example Generation for Machine Translation

Dohyun Lee, Seungil Chad Lee, Chanwoo Yang, Yujin Baek, Jaegul Choo

PDF

Open Access

TL;DR

This paper introduces Demonstration Augmentation for Translation (DAT), a method that generates in-context examples for machine translation without external resources, improving translation quality especially for low-resource languages.

Contribution

It proposes a novel in-context example generation approach for machine translation that does not depend on human-annotated data, addressing low-resource language challenges.

Findings

01

DAT outperforms baselines in low-resource language translation

02

Generated example pairs improve translation quality

03

Progressive accumulation of pairs enhances performance

Abstract

Large language models (LLMs) have demonstrated strong performance across various tasks, leveraging their exceptional in-context learning ability with only a few examples. Accordingly, the selection of optimal in-context examples has been actively studied in the field of machine translation. However, these studies presuppose the presence of a demonstration pool with human-annotated pairs, making them less applicable to low-resource languages where such an assumption is challenging to meet. To overcome this limitation, this paper explores the research direction of in-context example generation for machine translation. Specifically, we propose Demonstration Augmentation for Translation (DAT), a simple yet effective approach that generates example pairs without relying on any external resources. This method builds upon two prior criteria, relevance and diversity, which have been highlighted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques