Automatic Classification of Arabic Literature into Historical Eras
Zainab Alhathloul, Irfan Ahmad

TL;DR
This study develops neural network models to automatically classify Arabic texts into historical eras, demonstrating promising results for binary classification and highlighting challenges in multi-class era classification.
Contribution
It introduces the first deep learning approach for automatic Arabic text classification into multiple historical periods beyond poetry.
Findings
Binary classification F1-scores of 0.83 and 0.79
Multi-class classification F1-scores around 0.20
Models show potential but face challenges with finer era distinctions
Abstract
The Arabic language has undergone notable transformations over time, including the emergence of new vocabulary, the obsolescence of others, and shifts in word usage. This evolution is evident in the distinction between the classical and modern Arabic eras. Although historians and linguists have partitioned Arabic literature into multiple eras, relatively little research has explored the automatic classification of Arabic texts by time period, particularly beyond the domain of poetry. This paper addresses this gap by employing neural networks and deep learning techniques to automatically classify Arabic texts into distinct eras and periods. The proposed models are evaluated using two datasets derived from two publicly available corpora, covering texts from the pre-Islamic to the modern era. The study examines class setups ranging from binary to 15-class classification and considers both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Text and Document Classification Technologies · Topic Modeling
