Multilingual Machine Translation with Large Language Models: Empirical   Results and Analysis

Wenhao Zhu; Hongyi Liu; Qingxiu Dong; Jingjing Xu; Shujian Huang,; Lingpeng Kong; Jiajun Chen; Lei Li

arXiv:2304.04675·cs.CL·June 17, 2024·52 cites

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis

Wenhao Zhu, Hongyi Liu, Qingxiu Dong, Jingjing Xu, Shujian Huang,, Lingpeng Kong, Jiajun Chen, Lei Li

PDF

Open Access 2 Repos 1 Datasets 1 Video

TL;DR

This paper systematically evaluates large language models like GPT-4 for multilingual machine translation, revealing their strengths, limitations, and unique working patterns across various languages and resource settings.

Contribution

It provides a comprehensive empirical analysis of LLMs' translation performance, highlighting new insights into their resource efficiency and exemplar usage for low-resource languages.

Findings

01

GPT-4 outperforms NLLB in 40.91% of directions

02

LLMs can generate moderate translation for zero-resource languages

03

Cross-lingual exemplars improve low-resource translation

Abstract

Large language models (LLMs) have demonstrated remarkable potential in handling multilingual machine translation (MMT). In this paper, we systematically investigate the advantages and challenges of LLMs for MMT by answering two questions: 1) How well do LLMs perform in translating massive languages? 2) Which factors affect LLMs' performance in translation? We thoroughly evaluate eight popular LLMs, including ChatGPT and GPT-4. Our empirical results show that translation capabilities of LLMs are continually involving. GPT-4 has beat the strong supervised baseline NLLB in 40.91% of translation directions but still faces a large gap towards the commercial translation system like Google Translate, especially on low-resource languages. Through further analysis, we discover that LLMs exhibit new working patterns when used for MMT. First, LLM can acquire translation ability in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

LumiOpen/instruction-collection-fin
dataset· 106 dl
106 dl

Videos

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Layer Normalization · Label Smoothing · Byte Pair Encoding · Dense Connections · Position-Wise Feed-Forward Layer · Residual Connection