LLMs Are Zero-Shot Context-Aware Simultaneous Translators

Roman Koshkin; Katsuhito Sudoh; Satoshi Nakamura

arXiv:2406.13476·cs.CL·June 26, 2024

LLMs Are Zero-Shot Context-Aware Simultaneous Translators

Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper demonstrates that open-source large language models can perform zero-shot simultaneous translation effectively, especially with minimal background info, highlighting their potential for multilingual, context-aware translation without extensive training.

Contribution

It shows that open-source LLMs can achieve competitive zero-shot simultaneous translation and improve with minimal background info, reducing the need for resource-intensive training.

Findings

01

LLMs perform on par or better than state-of-the-art in zero-shot SiMT

02

Minimal background info enhances translation performance

03

LLMs enable resource-efficient, multilingual, context-aware SiMT systems

Abstract

The advent of transformers has fueled progress in machine translation. More recently large language models (LLMs) have come to the spotlight thanks to their generality and strong performance in a wide range of language tasks, including translation. Here we show that open-source LLMs perform on par with or better than some state-of-the-art baselines in simultaneous machine translation (SiMT) tasks, zero-shot. We also demonstrate that injection of minimal background information, which is easy with an LLM, brings further performance gains, especially on challenging technical subject-matter. This highlights LLMs' potential for building next generation of massively multilingual, context-aware and terminologically accurate SiMT systems that require no resource-intensive training or fine-tuning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

romankoshkin/tollmatch
jaxOfficial

Videos

LLMs Are Zero-Shot Context-Aware Simultaneous Translators· underline

Taxonomy

TopicsNatural Language Processing Techniques