Data-Efficient Domain Adaptation for LLM-based MT using Contrastive Preference Optimization

Inacio Vieira; Antonio Castaldo; James O'Doherty; Sheila Castilho

arXiv:2510.27556·cs.CL·November 3, 2025

Data-Efficient Domain Adaptation for LLM-based MT using Contrastive Preference Optimization

Inacio Vieira, Antonio Castaldo, James O'Doherty, Sheila Castilho

PDF

Open Access

TL;DR

This paper introduces a data-efficient domain adaptation method for large language model-based machine translation using contrastive preference optimization, achieving high performance with significantly less data.

Contribution

It presents a novel application of contrastive preference optimization for domain adaptation, reducing data requirements in machine translation tasks.

Findings

01

Achieves near SFT performance with only 14.7k preference pairs

02

Demonstrates effectiveness across English-Brazilian Portuguese and English-Korean translation

03

Generalizes to other generative tasks using contrastive signals

Abstract

LLMs often require adaptation to domain-specific requirements, a process that can be expensive when relying solely on SFT. We present an empirical study on applying CPO to simulate a post-editing workflow for data-efficient domain adaptation. Our approach synthesizes preference pairs by treating the base model's own raw output as the 'rejected' translation and the human-approved TM entry as the 'chosen' one. This method provides direct feedback on the model's current knowledge, guiding it to align with domain-specific standards. Experiments in English-Brazilian Portuguese and English-Korean show that, by using just 14.7k preference pairs, the model achieves performance close to that of a model trained on 160k+ samples with SFT, demonstrating significant data efficiency. Although we showcase its effectiveness in MT, this application of CPO naturally generalizes to other generative tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis