Can LLMs Faithfully Explain Themselves in Low-Resource Languages? A Case Study on Emotion Detection in Persian

Mobina Mehrazar; Mohammad Amin Yousefi; Parisa Abolfath Beygi; Behnam Bahrak

arXiv:2511.19719·cs.CL·November 26, 2025

Can LLMs Faithfully Explain Themselves in Low-Resource Languages? A Case Study on Emotion Detection in Persian

Mobina Mehrazar, Mohammad Amin Yousefi, Parisa Abolfath Beygi, Behnam Bahrak

PDF

Open Access

TL;DR

This paper investigates whether large language models provide faithful self-explanations in low-resource languages, specifically Persian, revealing that current methods often produce explanations that do not align well with human reasoning despite good classification performance.

Contribution

The study evaluates the faithfulness of LLM explanations in Persian emotion detection, comparing prompting strategies and highlighting limitations in current explanation methods for low-resource languages.

Findings

01

LLMs perform well in emotion classification in Persian.

02

Generated explanations often diverge from human reasoning.

03

Explanation faithfulness is influenced by prompting strategies.

Abstract

Large language models (LLMs) are increasingly used to generate self-explanations alongside their predictions, a practice that raises concerns about the faithfulness of these explanations, especially in low-resource languages. This study evaluates the faithfulness of LLM-generated explanations in the context of emotion classification in Persian, a low-resource language, by comparing the influential words identified by the model against those identified by human annotators. We assess faithfulness using confidence scores derived from token-level log-probabilities. Two prompting strategies, differing in the order of explanation and prediction (Predict-then-Explain and Explain-then-Predict), are tested for their impact on explanation faithfulness. Our results reveal that while LLMs achieve strong classification performance, their generated explanations often diverge from faithful reasoning,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Sentiment Analysis and Opinion Mining · Topic Modeling