YMIR: A new Benchmark Dataset and Model for Arabic Yemeni Music Genre Classification Using Convolutional Neural Networks
Moeen AL-Makhlafi, Abdulrahman A. AlKannad, Eiad Almekhlafi, Nawaf Q. Othman Ahmed Mohammed, Saher Qaid

TL;DR
This paper introduces YMIR, a new Yemeni music dataset, and YMCM, a CNN-based model, demonstrating high accuracy in classifying traditional Yemeni music genres, thus addressing cultural underrepresentation in music retrieval.
Contribution
The paper presents the first curated Yemeni music dataset and a CNN model tailored for genre classification, establishing new benchmarks for culturally specific music analysis.
Findings
YMCM achieved 98.8% accuracy with Mel-spectrograms.
The dataset has high inter-annotator agreement (Fleiss kappa = 0.85).
Mel-spectrogram features outperform other representations.
Abstract
Automatic music genre classification is a major task in music information retrieval; however, most current benchmarks and models have been developed primarily for Western music, leaving culturally specific traditions underrepresented. In this paper, we introduce the Yemeni Music Information Retrieval (YMIR) dataset, which contains 1,475 carefully selected audio clips covering five traditional Yemeni genres: Sanaani, Hadhrami, Lahji, Tihami, and Adeni. The dataset was labeled by five Yemeni music experts following a clear and structured protocol, resulting in strong inter-annotator agreement (Fleiss kappa = 0.85). We also propose the Yemeni Music Classification Model (YMCM), a convolutional neural network (CNN)-based system designed to classify music genres from time-frequency features. Using a consistent preprocessing pipeline, we perform a systematic comparison across six experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
