Don't Trust ChatGPT when Your Question is not in English: A Study of Multilingual Abilities and Types of LLMs
Xiang Zhang, Senyu Li, Bradley Hauer, Ning Shi, Grzegorz Kondrak

TL;DR
This paper investigates the multilingual abilities of Large Language Models, revealing their translation-like behavior and performance disparities across languages, using a novel back-translation prompting method to analyze how training data influences multilingual competence.
Contribution
It introduces a systematic approach to evaluate multilingual performance disparities and explores how insufficient training data affects LLMs' cross-language generalization.
Findings
GPT exhibits translation-like behavior in multilingual tasks
Performance varies significantly across different languages
Back-translation prompts reveal insights into LLMs' multilingual capabilities
Abstract
Large Language Models (LLMs) have demonstrated exceptional natural language understanding abilities and have excelled in a variety of natural language processing (NLP)tasks in recent years. Despite the fact that most LLMs are trained predominantly in English, multiple studies have demonstrated their comparative performance in many other languages. However, fundamental questions persist regarding how LLMs acquire their multi-lingual abilities and how performance varies across different languages. These inquiries are crucial for the study of LLMs since users and researchers often come from diverse language backgrounds, potentially influencing their utilization and interpretation of LLMs' results. In this work, we propose a systematic way of qualifying the performance disparities of LLMs under multilingual settings. We investigate the phenomenon of across-language generalizations in LLMs,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Natural Language Processing Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Residual Connection · Linear Warmup With Cosine Annealing
