A dataset and classification model for Malay, Hindi, Tamil and Chinese   music

Fajilatun Nahar; Kat Agres; Balamurali BT; Dorien Herremans

arXiv:2009.04459·cs.SD·September 16, 2020·1 cites

A dataset and classification model for Malay, Hindi, Tamil and Chinese music

Fajilatun Nahar, Kat Agres, Balamurali BT, Dorien Herremans

PDF

Open Access

TL;DR

This paper introduces a new dataset of music from Singapore's main ethnic groups and develops classification models to identify the origin of music based on various musical features.

Contribution

The paper provides a novel dataset and explores multiple feature-based classification models for ethnic music origin identification.

Findings

01

High accuracy in classifying music by ethnic origin

02

Effective use of both high-level and low-level musical features

03

Improved model performance through feature optimization

Abstract

In this paper we present a new dataset, with musical excepts from the three main ethnic groups in Singapore: Chinese, Malay and Indian (both Hindi and Tamil). We use this new dataset to train different classification models to distinguish the origin of the music in terms of these ethnic groups. The classification models were optimized by exploring the use of different musical features as the input. Both high level features, i.e., musically meaningful features, as well as low level features, i.e., spectrogram based features, were extracted from the audio files so as to optimize the performance of the different classification models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies