Learning a Latent Space of Multitrack Measures
Ian Simon, Adam Roberts, Colin Raffel, Jesse Engel, Curtis Hawthorne,, Douglas Eck

TL;DR
This paper extends MusicVAE to create a latent space for multitrack measures, enabling meaningful music generation, interpolation, and attribute manipulation with chord conditioning for harmony control.
Contribution
It introduces a novel latent space representation for multitrack music with chord conditioning, allowing flexible and controlled music generation and manipulation.
Findings
Enables generation of plausible multitrack measures from scratch
Allows interpolation between musical measures in a meaningful way
Supports manipulation of musical attributes while maintaining harmony
Abstract
Discovering and exploring the underlying structure of multi-instrumental music using learning-based approaches remains an open problem. We extend the recent MusicVAE model to represent multitrack polyphonic measures as vectors in a latent space. Our approach enables several useful operations such as generating plausible measures from scratch, interpolating between measures in a musically meaningful way, and manipulating specific musical attributes. We also introduce chord conditioning, which allows all of these operations to be performed while keeping harmony fixed, and allows chords to be changed while maintaining musical "style". By generating a sequence of measures over a predefined chord progression, our model can produce music with convincing long-term structure. We demonstrate that our latent space model makes it possible to intuitively control and generate musical sequences with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception
