Music Genre Classification using Masked Conditional Neural Networks

Fady Medhat; David Chesmore; John Robinson

arXiv:1802.06432·cs.LG·April 12, 2019

Music Genre Classification using Masked Conditional Neural Networks

Fady Medhat, David Chesmore, John Robinson

PDF

1 Repo

TL;DR

This paper introduces the Masked Conditional Neural Network (MCLNN), a model designed to improve music genre classification by capturing time-frequency representations and automating feature exploration, achieving competitive accuracy.

Contribution

The paper presents MCLNN, a novel neural network architecture that enforces systematic sparsity and automates feature selection for music genre classification.

Findings

01

MCLNN achieves competitive accuracy with state-of-the-art methods.

02

The mask enforces frequency band learning, improving robustness to shifts.

03

Automated feature exploration reduces manual tuning effort.

Abstract

The ConditionaL Neural Networks (CLNN) and the Masked ConditionaL Neural Networks (MCLNN) exploit the nature of multi-dimensional temporal signals. The CLNN captures the conditional temporal influence between the frames in a window and the mask in the MCLNN enforces a systematic sparseness that follows a filterbank-like pattern over the network links. The mask induces the network to learn about time-frequency representations in bands, allowing the network to sustain frequency shifts. Additionally, the mask in the MCLNN automates the exploration of a range of feature combinations, usually done through an exhaustive manual search. We have evaluated the MCLNN performance using the Ballroom and Homburg datasets of music genres. MCLNN has achieved accuracies that are competitive to state-of-the-art handcrafted attempts in addition to models based on Convolutional Neural Networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fadymedhat/MCLNN
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.