# Dhati+: Fine-tuned Large Language Models for Arabic Subjectivity Evaluation

**Authors:** Slimane Bellaouar, Attia Nehar, Soumia Souffi, Mounia Bouameur

arXiv: 2508.19966 · 2026-03-02

## TL;DR

This paper introduces AraDhati+, a new annotated dataset for Arabic subjectivity analysis, and fine-tunes multiple large language models, achieving high accuracy in classifying Arabic subjective texts despite resource limitations.

## Contribution

It presents a comprehensive Arabic subjectivity dataset and demonstrates effective fine-tuning of state-of-the-art models, including ensemble methods, for improved Arabic text classification.

## Key findings

- Achieved 97.79% accuracy in Arabic subjectivity classification.
- Developed AraDhati+ dataset from multiple sources.
- Ensemble approach enhances model performance.

## Abstract

Despite its significance, Arabic, a linguistically rich and morphologically complex language, faces the challenge of being under-resourced. The scarcity of large annotated datasets hampers the development of accurate tools for subjectivity analysis in Arabic. Recent advances in deep learning and Transformers have proven highly effective for text classification in English and French. This paper proposes a new approach for subjectivity assessment in Arabic textual data. To address the dearth of specialized annotated datasets, we developed a comprehensive dataset, AraDhati+, by leveraging existing Arabic datasets and collections (ASTD, LABR, HARD, and SANAD). Subsequently, we fine-tuned state-of-the-art Arabic language models (XLM-RoBERTa, AraBERT, and ArabianGPT) on AraDhati+ for effective subjectivity classification. Furthermore, we experimented with an ensemble decision approach to harness the strengths of individual models. Our approach achieves a remarkable accuracy of 97.79\,\% for Arabic subjectivity classification. Results demonstrate the effectiveness of the proposed approach in addressing the challenges posed by limited resources in Arabic language processing.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.19966/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/2508.19966/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/2508.19966/full.md

---
Source: https://tomesphere.com/paper/2508.19966