Kinit Classification in Ethiopian Chants, Azmaris and Modern Music: A New Dataset and CNN Benchmark
Ephrem A. Retta, Richard Sutcliffe, Eiad Almekhlafi, Yosef K. Enku,, Eyob Alemu, Tigist D. Gemechu, Michael A. Berwo, Mustafa Mhamed, Jun Feng

TL;DR
This paper introduces EMIR, a new Ethiopian music dataset, and develops a CNN-based model that achieves high accuracy in classifying Ethiopian Kinit styles, combining scale and genre recognition.
Contribution
The paper presents the first Ethiopian music dataset EMIR and a CNN benchmark model for Kinit classification, demonstrating superior performance over existing models.
Findings
MFCC features outperform others for Kinit classification
3-second audio samples yield best results
EKM achieves 95% accuracy and fastest training time
Abstract
In this paper, we create EMIR, the first-ever Music Information Retrieval dataset for Ethiopian music. EMIR is freely available for research purposes and contains 600 sample recordings of Orthodox Tewahedo chants, traditional Azmari songs and contemporary Ethiopian secular music. Each sample is classified by five expert judges into one of four well-known Ethiopian Kinits, Tizita, Bati, Ambassel and Anchihoye. Each Kinit uses its own pentatonic scale and also has its own stylistic characteristics. Thus, Kinit classification needs to combine scale identification with genre recognition. After describing the dataset, we present the Ethio Kinits Model (EKM), based on VGG, for classifying the EMIR clips. In Experiment 1, we investigated whether Filterbank, Mel-spectrogram, Chroma, or Mel-frequency Cepstral coefficient (MFCC) features work best for Kinit classification using EKM. MFCC was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Diverse Musicological Studies · Music Technology and Sound Studies
MethodsMax Pooling · Dropout · Sigmoid Activation · Tanh Activation · Dense Connections · Convolution · Softmax · Long Short-Term Memory
