A Large-Scale Study of Language Models for Chord Prediction

Filip Korzeniowski; David R. W. Sears; Gerhard Widmer

arXiv:1804.01849·cs.LG·April 6, 2018

A Large-Scale Study of Language Models for Chord Prediction

Filip Korzeniowski, David R. W. Sears, Gerhard Widmer

PDF

TL;DR

This study compares N-gram and recurrent neural network models for chord prediction across extensive datasets, revealing that RNNs can adapt to individual songs, enhancing local musical context understanding.

Contribution

It provides a comprehensive comparison of language models for chord prediction and demonstrates RNNs' ability to adapt to specific songs, advancing context-aware chord recognition.

Findings

01

RNNs outperform N-gram models in chord prediction.

02

Certain RNN configurations adapt to individual songs at test time.

03

The study offers insights into hyper-parameter tuning for RNNs.

Abstract

We conduct a large-scale study of language models for chord prediction. Specifically, we compare N-gram models to various flavours of recurrent neural networks on a comprehensive dataset comprising all publicly available datasets of annotated chords known to us. This large amount of data allows us to systematically explore hyper-parameter settings for the recurrent neural networks---a crucial step in achieving good results with this model class. Our results show not only a quantitative difference between the models, but also a qualitative one: in contrast to static N-gram models, certain RNN configurations adapt to the songs at test time. This finding constitutes a further step towards the development of chord recognition systems that are more aware of local musical context than what was previously possible.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.