A Comparative Evaluation of Large Language Models for Persian Sentiment Analysis and Emotion Detection in Social Media Texts

Kian Tohidi; Kia Dashtipour; Simone Rebora; Sevda Pourfaramarz

arXiv:2509.14922·cs.CL·September 19, 2025·2 cites

A Comparative Evaluation of Large Language Models for Persian Sentiment Analysis and Emotion Detection in Social Media Texts

Kian Tohidi, Kia Dashtipour, Simone Rebora, Sevda Pourfaramarz

PDF

Open Access

TL;DR

This paper compares four large language models on Persian social media sentiment and emotion detection tasks, revealing performance benchmarks, efficiency considerations, and linguistic challenges in Persian NLP applications.

Contribution

It provides the first comprehensive, fair comparison of LLMs on Persian sentiment and emotion analysis, highlighting performance, cost-efficiency, and linguistic challenges.

Findings

01

GPT-4o achieved marginally higher accuracy.

02

Gemini 2.0 Flash was most cost-efficient.

03

Emotion detection is more challenging than sentiment analysis.

Abstract

This study presents a comprehensive comparative evaluation of four state-of-the-art Large Language Models (LLMs)--Claude 3.7 Sonnet, DeepSeek-V3, Gemini 2.0 Flash, and GPT-4o--for sentiment analysis and emotion detection in Persian social media texts. Comparative analysis among LLMs has witnessed a significant rise in recent years, however, most of these analyses have been conducted on English language tasks, creating gaps in understanding cross-linguistic performance patterns. This research addresses these gaps through rigorous experimental design using balanced Persian datasets containing 900 texts for sentiment analysis (positive, negative, neutral) and 1,800 texts for emotion detection (anger, fear, happiness, hate, sadness, surprise). The main focus was to allow for a direct and fair comparison among different models, by using consistent prompts, uniform processing parameters, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Mental Health via Writing · Hate Speech and Cyberbullying Detection