Do Large Language Models Possess Sensitive to Sentiment?

Yang Liu; Xichou Zhu; Zhou Shen; Yi Liu; Min Li; Yujun Chen; Benzi; John; Zhenzhen Ma; Tao Hu; Zhi Li; Zhiyang Xu; Wei Luo; Junhui Wang

arXiv:2409.02370·cs.CL·February 17, 2025

Do Large Language Models Possess Sensitive to Sentiment?

Yang Liu, Xichou Zhu, Zhou Shen, Yi Liu, Min Li, Yujun Chen, Benzi, John, Zhenzhen Ma, Tao Hu, Zhi Li, Zhiyang Xu, Wei Luo, Junhui Wang

PDF

Open Access

TL;DR

This paper evaluates the sentiment detection capabilities of large language models, revealing their basic sensitivity but highlighting significant accuracy and consistency gaps, especially with subtle or sarcastic sentiments.

Contribution

It provides a comprehensive assessment of LLMs' sentiment understanding, identifying key performance gaps and variability across models, guiding future improvements.

Findings

01

LLMs show basic sentiment sensitivity but with notable accuracy gaps.

02

Models often misclassify strong sentiments and sarcasm.

03

Performance varies significantly across different LLM architectures.

Abstract

Large Language Models (LLMs) have recently displayed their extraordinary capabilities in language understanding. However, how to comprehensively assess the sentiment capabilities of LLMs continues to be a challenge. This paper investigates the ability of LLMs to detect and react to sentiment in text modal. As the integration of LLMs into diverse applications is on the rise, it becomes highly critical to comprehend their sensitivity to emotional tone, as it can influence the user experience and the efficacy of sentiment-driven tasks. We conduct a series of experiments to evaluate the performance of several prominent LLMs in identifying and responding appropriately to sentiments like positive, negative, and neutral emotions. The models' outputs are analyzed across various sentiment benchmarks, and their responses are compared with human evaluations. Our discoveries indicate that although…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsSparse Evolutionary Training