Coupled Recurrent Models for Polyphonic Music Composition

John Thickstun; Zaid Harchaoui; Dean P. Foster; Sham M. Kakade

arXiv:1811.08045·cs.SD·November 28, 2019·6 cites

Coupled Recurrent Models for Polyphonic Music Composition

John Thickstun, Zaid Harchaoui, Dean P. Foster, Sham M. Kakade

PDF

Open Access

TL;DR

This paper presents a new recurrent neural model for polyphonic music composition that captures the structure of concurrent voices using a probabilistic factorization, combining convolutional and recurrent ideas.

Contribution

It introduces a novel conditional probabilistic factorization tailored for polyphonic music, integrating convolutional and recurrent neural networks for improved modeling.

Findings

01

Effective modeling of polyphonic scores with coupled recurrent models

02

Trained on 2,300 scores from the KernScores dataset

03

Demonstrates improved structure capturing in music generation

Abstract

This paper introduces a novel recurrent model for music composition that is tailored to the structure of polyphonic music. We propose an efficient new conditional probabilistic factorization of musical scores, viewing a score as a collection of concurrent, coupled sequences: i.e. voices. To model the conditional distributions, we borrow ideas from both convolutional and recurrent neural models; we argue that these ideas are natural for capturing music's pitch invariances, temporal structure, and polyphony. We train models for single-voice and multi-voice composition on 2,300 scores from the KernScores dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception