YMIR: A new Benchmark Dataset and Model for Arabic Yemeni Music Genre Classification Using Convolutional Neural Networks

Moeen AL-Makhlafi; Abdulrahman A. AlKannad; Eiad Almekhlafi; Nawaf Q. Othman Ahmed Mohammed; Saher Qaid

arXiv:2604.05011·cs.SD·April 8, 2026

YMIR: A new Benchmark Dataset and Model for Arabic Yemeni Music Genre Classification Using Convolutional Neural Networks

Moeen AL-Makhlafi, Abdulrahman A. AlKannad, Eiad Almekhlafi, Nawaf Q. Othman Ahmed Mohammed, Saher Qaid

PDF

TL;DR

This paper introduces YMIR, a new Yemeni music dataset, and YMCM, a CNN-based model, demonstrating high accuracy in classifying traditional Yemeni music genres, thus addressing cultural underrepresentation in music retrieval.

Contribution

The paper presents the first curated Yemeni music dataset and a CNN model tailored for genre classification, establishing new benchmarks for culturally specific music analysis.

Findings

01

YMCM achieved 98.8% accuracy with Mel-spectrograms.

02

The dataset has high inter-annotator agreement (Fleiss kappa = 0.85).

03

Mel-spectrogram features outperform other representations.

Abstract

Automatic music genre classification is a major task in music information retrieval; however, most current benchmarks and models have been developed primarily for Western music, leaving culturally specific traditions underrepresented. In this paper, we introduce the Yemeni Music Information Retrieval (YMIR) dataset, which contains 1,475 carefully selected audio clips covering five traditional Yemeni genres: Sanaani, Hadhrami, Lahji, Tihami, and Adeni. The dataset was labeled by five Yemeni music experts following a clear and structured protocol, resulting in strong inter-annotator agreement (Fleiss kappa = 0.85). We also propose the Yemeni Music Classification Model (YMCM), a convolutional neural network (CNN)-based system designed to classify music genres from time-frequency features. Using a consistent preprocessing pipeline, we perform a systematic comparison across six experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.