Towards Effective Disambiguation for Machine Translation with Large   Language Models

Vivek Iyer; Pinzhen Chen; Alexandra Birch

arXiv:2309.11668·cs.CL·October 24, 2023

Towards Effective Disambiguation for Machine Translation with Large Language Models

Vivek Iyer, Pinzhen Chen, Alexandra Birch

PDF

Open Access

TL;DR

This paper investigates the use of large language models for disambiguation in machine translation, proposing methods to enhance their ability to handle ambiguous sentences, and demonstrating competitive performance against state-of-the-art systems.

Contribution

It introduces two techniques— in-context learning and fine-tuning on curated datasets— to improve LLMs' disambiguation capabilities in machine translation.

Findings

01

Methods match or outperform DeepL and NLLB in four language directions.

02

Curated datasets and resources are publicly released.

03

Provides insights into adapting LLMs for better disambiguation in MT.

Abstract

Resolving semantic ambiguity has long been recognised as a central challenge in the field of Machine Translation. Recent work on benchmarking translation performance on ambiguous sentences has exposed the limitations of conventional Neural Machine Translation (NMT) systems, which fail to handle many such cases. Large language models (LLMs) have emerged as a promising alternative, demonstrating comparable performance to traditional NMT models while introducing new paradigms for controlling the target outputs. In this paper, we study the capabilities of LLMs to translate "ambiguous sentences" - i.e. those containing highly polysemous words and/or rare word senses. We also propose two ways to improve their disambiguation capabilities, through a) in-context learning and b) fine-tuning on carefully curated ambiguous datasets. Experiments show that our methods can match or outperform…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

Methodsfail