Feature Learning for Chord Recognition: The Deep Chroma Extractor

Filip Korzeniowski; Gerhard Widmer

arXiv:1612.05065·cs.SD·December 16, 2016·59 cites

Feature Learning for Chord Recognition: The Deep Chroma Extractor

Filip Korzeniowski, Gerhard Widmer

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural network-based learned chroma feature extractor that improves chord recognition accuracy by producing more robust harmonic features compared to traditional methods.

Contribution

It proposes a novel neural network approach to learn chroma features directly from audio spectra, enhancing robustness and harmonic information encoding for chord recognition.

Findings

01

Learned chroma features outperform hand-crafted ones in chord recognition.

02

The neural network effectively reduces noise and resolves harmonic ambiguities.

03

The approach improves accuracy across various datasets.

Abstract

We explore frame-level audio feature learning for chord recognition using artificial neural networks. We present the argument that chroma vectors potentially hold enough information to model harmonic content of audio for chord recognition, but that standard chroma extractors compute too noisy features. This leads us to propose a learned chroma feature extractor based on artificial neural networks. It is trained to compute chroma features that encode harmonic information important for chord recognition, while being robust to irrelevant interferences. We achieve this by feeding the network an audio spectrum with context instead of a single frame as input. This way, the network can learn to selectively compensate noise and resolve harmonic ambiguities. We compare the resulting features to hand-crafted ones by using a simple linear frame-wise classifier for chord recognition on various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CPJKU/madmom
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies