# EMSA: Explainable multilingual sentiment analysis models providing sentiment analysis across multiple languages

**Authors:** Li Zhao, Jinwei Zhou, Jinde Cao, Weina Zhu

PMC · DOI: 10.1371/journal.pone.0333508 · PLOS One · 2025-11-12

## TL;DR

EMSA is a new framework for multilingual sentiment analysis that improves accuracy and provides clear explanations for its predictions.

## Contribution

EMSA introduces a two-stage process combining large language models and prompt engineering for interpretable multilingual sentiment analysis.

## Key findings

- EMSA outperforms models like RoBERTa, XLNet, and ALBERT in sentiment classification.
- The model provides transparent reasoning steps, increasing user trust in its predictions.
- It performs well on both Chinese financial and English benchmark datasets.

## Abstract

Sentiment analysis across multiple languages remains a challenging problem due to linguistic diversity, domain-specific expressions, and the limited explainability of existing models. This study aims to address these issues by proposing the Explainable Multilingual Sentiment Analyzer (EMSA), a novel framework that integrates large language models with prompt engineering. EMSA employs a two-stage process, first generating sentiment reasoning through chain-of-thought prompts, and then producing sentiment classification with explicit interpretability. We evaluate EMSA on both the GubaSenti dataset (Chinese financial domain) and the SST dataset (English benchmark). Experimental results demonstrate that EMSA consistently outperforms pre-trained language models such as RoBERTa, XLNet, and ALBERT, while providing transparent reasoning steps that enhance user trust. These findings suggest that EMSA not only improves multilingual sentiment classification performance but also contributes to the development of more interpretable and practical sentiment analysis systems.

## Full-text entities

- **Genes:** SST (somatostatin) [NCBI Gene 6750] {aka SMST, SST1}
- **Diseases:** PLMs (MESH:D000095027), LLMs (MESH:D007806)
- **Chemicals:** PLM (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12611133/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12611133/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/PMC12611133/full.md

---
Source: https://tomesphere.com/paper/PMC12611133