Multilingual Large Language Models Are Not (Yet) Code-Switchers

Ruochen Zhang; Samuel Cahyawijaya; Jan Christian Blaise Cruz; Genta; Indra Winata; Alham Fikri Aji

arXiv:2305.14235·cs.CL·October 24, 2023·2 cites

Multilingual Large Language Models Are Not (Yet) Code-Switchers

Ruochen Zhang, Samuel Cahyawijaya, Jan Christian Blaise Cruz, Genta, Indra Winata, Alham Fikri Aji

PDF

Open Access

TL;DR

This paper empirically evaluates multilingual large language models' ability to handle code-switching, revealing they underperform compared to smaller fine-tuned models and highlighting the need for targeted research in this area.

Contribution

It provides a comprehensive benchmark of multilingual LLMs on code-switching tasks, demonstrating their limitations and the gap between multilingual capabilities and code-switching proficiency.

Findings

01

Multilingual LLMs perform variably across tasks with zero/few-shot prompting.

02

They underperform compared to smaller fine-tuned models in code-switching tasks.

03

Current multilingualism in LLMs does not ensure effective handling of code-switching texts.

Abstract

Multilingual Large Language Models (LLMs) have recently shown great capabilities in a wide range of tasks, exhibiting state-of-the-art performance through zero-shot or few-shot prompting methods. While there have been extensive studies on their abilities in monolingual tasks, the investigation of their potential in the context of code-switching (CSW), the practice of alternating languages within an utterance, remains relatively uncharted. In this paper, we provide a comprehensive empirical analysis of various multilingual LLMs, benchmarking their performance across four tasks: sentiment analysis, machine translation, summarization and word-level language identification. Our results indicate that despite multilingual LLMs exhibiting promising outcomes in certain tasks using zero or few-shot prompting, they still underperform in comparison to fine-tuned models of much smaller scales. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis