Multi-Step Chord Sequence Prediction Based on Aggregated Multi-Scale Encoder-Decoder Network
Tristan Carsault, Andrew McLeod, Philippe Esling, J\'er\^ome Nika,, Eita Nakamura, Kazuyoshi Yoshii

TL;DR
This paper introduces a multi-scale encoder-decoder architecture for multi-step jazz chord sequence prediction, leveraging iterative temporal aggregation to improve accuracy and musical relevance over existing models.
Contribution
It proposes a novel multi-scale neural network approach that effectively captures musical structures for improved multi-step chord prediction.
Findings
Outperforms state-of-the-art models in accuracy and perplexity.
Requires fewer parameters than comparable models.
Shows the influence of downbeat position on prediction accuracy.
Abstract
This paper studies the prediction of chord progressions for jazz music by relying on machine learning models. The motivation of our study comes from the recent success of neural networks for performing automatic music composition. Although high accuracies are obtained in single-step prediction scenarios, most models fail to generate accurate multi-step chord predictions. In this paper, we postulate that this comes from the multi-scale structure of musical information and propose new architectures based on an iterative temporal aggregation of input labels. Specifically, the input and ground truth labels are merged into increasingly large temporal bags, on which we train a family of encoder-decoder networks for each temporal scale. In a second step, we use these pre-trained encoder bottleneck features at each scale in order to train a final encoder-decoder network. Furthermore, we rely on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception
