ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large   Language Models in Multilingual Learning

Viet Dac Lai; Nghia Trung Ngo; Amir Pouran Ben Veyseh; Hieu Man,; Franck Dernoncourt; Trung Bui; Thien Huu Nguyen

arXiv:2304.05613·cs.CL·April 13, 2023·51 cites

ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning

Viet Dac Lai, Nghia Trung Ngo, Amir Pouran Ben Veyseh, Hieu Man,, Franck Dernoncourt, Trung Bui, Thien Huu Nguyen

PDF

Open Access

TL;DR

This paper evaluates ChatGPT's performance across 37 languages and 7 NLP tasks, revealing limitations in multilingual capabilities and highlighting the need for improved models in diverse language settings.

Contribution

It provides a comprehensive, zero-shot evaluation of ChatGPT on multiple languages and tasks, filling a gap in understanding its multilingual effectiveness.

Findings

01

ChatGPT performs worse than previous models on multilingual NLP tasks.

02

Performance varies significantly across languages with different resource levels.

03

Results highlight the need for further research to enhance multilingual capabilities.

Abstract

Over the last few years, large language models (LLMs) have emerged as the most important breakthroughs in natural language processing (NLP) that fundamentally transform research and developments in the field. ChatGPT represents one of the most exciting LLM systems developed recently to showcase impressive skills for language generation and highly attract public attention. Among various exciting applications discovered for ChatGPT in English, the model can process and generate texts for multiple languages due to its multilingual training data. Given the broad adoption of ChatGPT for English in different problems and areas, a natural question is whether ChatGPT can also be applied effectively for other languages or it is necessary to develop more language-specific technologies. The answer to this question requires a thorough evaluation of ChatGPT over multiple tasks with diverse languages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · COVID-19 diagnosis using AI