ChatGPT MT: Competitive for High- (but not Low-) Resource Languages

Nathaniel R. Robinson; Perez Ogayo; David R. Mortensen; Graham; Neubig

arXiv:2309.07423·cs.CL·September 15, 2023·5 cites

ChatGPT MT: Competitive for High- (but not Low-) Resource Languages

Nathaniel R. Robinson, Perez Ogayo, David R. Mortensen, Graham, Neubig

PDF

Open Access 2 Repos

TL;DR

This study evaluates ChatGPT's machine translation performance across 204 languages, finding it excels in high-resource languages but underperforms in low-resource and African languages, highlighting resource level as a key factor.

Contribution

First comprehensive evaluation of ChatGPT's MT capabilities across a wide range of languages, revealing resource-dependent performance disparities.

Findings

01

GPT approaches or exceeds traditional MT in high-resource languages.

02

GPT underperforms in 84.1% of low-resource languages.

03

Resource level is the primary factor affecting translation quality.

Abstract

Large language models (LLMs) implicitly learn to perform a range of language tasks, including machine translation (MT). Previous studies explore aspects of LLMs' MT capabilities. However, there exist a wide variety of languages for which recent LLM MT performance has never before been evaluated. Without published experimental evidence on the matter, it is difficult for speakers of the world's diverse languages to know how and whether they can use LLMs for their languages. We present the first experimental evidence for an expansive set of 204 languages, along with MT cost analysis, using the FLORES-200 benchmark. Trends reveal that GPT models approach or exceed traditional MT model performance for some high-resource languages (HRLs) but consistently lag for low-resource languages (LRLs), under-performing traditional MT for 84.1% of languages we covered. Our analysis reveals that a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Linear Layer · Linear Warmup With Cosine Annealing · Dense Connections · Layer Normalization · Dropout · Attention Dropout · Discriminative Fine-Tuning