Learning meters of Arabic and English poems with Recurrent Neural Networks: a step forward for language understanding and synthesis
Waleed A. Yousef, Omar M. Ibrahime, Taha M. Madbouly, Moustafa A., Mahmoud

TL;DR
This paper develops RNN models to classify Arabic and English poem meters directly from raw text, achieving high accuracy and providing publicly available datasets for future research in language understanding and synthesis.
Contribution
It introduces the first machine learning approach using RNNs for poem meter classification and releases the largest publicly available dataset for this task.
Findings
Achieved 96.38% accuracy on Arabic poem meters
Achieved 82.31% accuracy on English poem meters
Provided publicly available, structured datasets for future research
Abstract
Recognizing a piece of writing as a poem or prose is usually easy for the majority of people; however, only specialists can determine which meter a poem belongs to. In this paper, we build Recurrent Neural Network (RNN) models that can classify poems according to their meters from plain text. The input text is encoded at the character level and directly fed to the models without feature handcrafting. This is a step forward for machine understanding and synthesis of languages in general, and Arabic language in particular. Among the 16 poem meters of Arabic and the 4 meters of English the networks were able to correctly classify poem with an overall accuracy of 96.38\% and 82.31\% respectively. The poem datasets used to conduct this research were massive, over 1.5 million of verses, and were crawled from different nontechnical sources, almost Arabic and English literature sites, and in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗CAMeL-Lab/bert-base-arabic-camelbert-ca-poetrymodel· 13 dl· ♡ 413 dl♡ 4
- 🤗CAMeL-Lab/bert-base-arabic-camelbert-da-poetrymodel· 9 dl9 dl
- 🤗CAMeL-Lab/bert-base-arabic-camelbert-mix-poetrymodel· 10 dl10 dl
- 🤗CAMeL-Lab/bert-base-arabic-camelbert-msa-poetrymodel· 11 dl· ♡ 111 dl♡ 1
- 🤗Yah216/Arabic_poem_meter_3model· 2 dl· ♡ 12 dl♡ 1
- 🤗Yah216/Poem_Qafiyah_Detectionmodel· 4 dl4 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsTanh Activation · Sigmoid Activation · Gated Recurrent Unit · Bidirectional GRU · Long Short-Term Memory · Bidirectional LSTM
