A Fully Convolutional Deep Auditory Model for Musical Chord Recognition

Filip Korzeniowski; Gerhard Widmer

arXiv:1612.05082·cs.LG·December 16, 2016

A Fully Convolutional Deep Auditory Model for Musical Chord Recognition

Filip Korzeniowski, Gerhard Widmer

PDF

TL;DR

This paper introduces a fully convolutional deep auditory model for musical chord recognition that automatically learns features and achieves competitive or superior results compared to existing methods.

Contribution

The paper presents a novel end-to-end deep learning approach combining a convolutional auditory model with a CRF for improved chord recognition without manual feature engineering.

Findings

01

Learned features are musically interpretable

02

Achieves state-of-the-art or better accuracy

03

Automatically trains feature extraction and decoding stages

Abstract

Chord recognition systems depend on robust feature extraction pipelines. While these pipelines are traditionally hand-crafted, recent advances in end-to-end machine learning have begun to inspire researchers to explore data-driven methods for such tasks. In this paper, we present a chord recognition system that uses a fully convolutional deep auditory model for feature extraction. The extracted features are processed by a Conditional Random Field that decodes the final chord sequence. Both processing stages are trained automatically and do not require expert knowledge for optimising parameters. We show that the learned auditory system extracts musically interpretable features, and that the proposed chord recognition system achieves results on par or better than state-of-the-art algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.