Adaptive Machine Translation with Large Language Models
Yasmin Moslem, Rejwanul Haque, John D. Kelleher, Andy Way

TL;DR
This paper explores how large language models can be used for real-time adaptive machine translation through in-context learning, improving translation quality by leveraging domain-specific prompts without additional fine-tuning.
Contribution
It demonstrates that large language models can adapt to specific domains and terminology in real-time, surpassing some traditional MT systems in translation quality for high-resource languages.
Findings
LLMs can adapt to in-domain data during translation without fine-tuning.
Few-shot in-context learning can outperform strong encoder-decoder MT systems.
Combining encoder-decoder MT with fuzzy matching improves translation for less supported languages.
Abstract
Consistency is a key requirement of high-quality translation. It is especially important to adhere to pre-approved terminology and adapt to corrected translations in domain-specific projects. Machine translation (MT) has achieved significant progress in the area of domain adaptation. However, real-time adaptation remains challenging. Large-scale language models (LLMs) have recently shown interesting capabilities of in-context learning, where they learn to replicate certain input-output text generation patterns, without further fine-tuning. By feeding an LLM at inference time with a prompt that consists of a list of translation pairs, it can then simulate the domain and style characteristics. This work aims to investigate how we can utilize in-context learning to improve real-time adaptive MT. Our extensive experiments show promising results at translation time. For example, LLMs can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Layer Normalization · Weight Decay · Multi-Head Attention · Residual Connection · {Dispute@FaQ-s}How to file a dispute with Expedia? · Dense Connections
