Don't Trust ChatGPT when Your Question is not in English: A Study of   Multilingual Abilities and Types of LLMs

Xiang Zhang; Senyu Li; Bradley Hauer; Ning Shi; Grzegorz Kondrak

arXiv:2305.16339·cs.CL·October 25, 2023·6 cites

Don't Trust ChatGPT when Your Question is not in English: A Study of Multilingual Abilities and Types of LLMs

Xiang Zhang, Senyu Li, Bradley Hauer, Ning Shi, Grzegorz Kondrak

PDF

Open Access

TL;DR

This paper investigates the multilingual abilities of Large Language Models, revealing their translation-like behavior and performance disparities across languages, using a novel back-translation prompting method to analyze how training data influences multilingual competence.

Contribution

It introduces a systematic approach to evaluate multilingual performance disparities and explores how insufficient training data affects LLMs' cross-language generalization.

Findings

01

GPT exhibits translation-like behavior in multilingual tasks

02

Performance varies significantly across different languages

03

Back-translation prompts reveal insights into LLMs' multilingual capabilities

Abstract

Large Language Models (LLMs) have demonstrated exceptional natural language understanding abilities and have excelled in a variety of natural language processing (NLP)tasks in recent years. Despite the fact that most LLMs are trained predominantly in English, multiple studies have demonstrated their comparative performance in many other languages. However, fundamental questions persist regarding how LLMs acquire their multi-lingual abilities and how performance varies across different languages. These inquiries are crucial for the study of LLMs since users and researchers often come from diverse language backgrounds, potentially influencing their utilization and interpretation of LLMs' results. In this work, we propose a systematic way of qualifying the performance disparities of LLMs under multilingual settings. We investigate the phenomenon of across-language generalizations in LLMs,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Natural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Residual Connection · Linear Warmup With Cosine Annealing