Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding
Tadesse Destaw Belay, Israel Abebe Azime, Abinew Ali Ayele, Grigori, Sidorov, Dietrich Klakow, Philipp Slusallek, Olga Kolesnikova, Seid Muhie, Yimam

TL;DR
This paper evaluates large language models' ability to classify multiple emotions across four Ethiopian languages and English, revealing significant performance gaps especially in low-resource languages and highlighting the need for further research.
Contribution
Introduces EthioEmo, a new multi-label emotion dataset for Ethiopian languages, and provides comprehensive evaluation of LLMs across multiple languages and model types.
Findings
High-resource languages like English show better emotion classification performance.
Significant performance gaps exist between high-resource and low-resource languages.
Varying results depending on language and model type highlight challenges in multilingual emotion understanding.
Abstract
Large Language Models (LLMs) show promising learning and reasoning abilities. Compared to other NLP tasks, multilingual and multi-label emotion evaluation tasks are under-explored in LLMs. In this paper, we present EthioEmo, a multi-label emotion classification dataset for four Ethiopian languages, namely, Amharic (amh), Afan Oromo (orm), Somali (som), and Tigrinya (tir). We perform extensive experiments with an additional English multi-label emotion dataset from SemEval 2018 Task 1. Our evaluation includes encoder-only, encoder-decoder, and decoder-only language models. We compare zero and few-shot approaches of LLMs to fine-tuning smaller language models. The results show that accurate multi-label emotion classification is still insufficient even for high-resource languages such as English, and there is a large gap between the performance of high-resource and low-resource languages.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Text and Document Classification Technologies · Topic Modeling
