MuseGAN: Multi-track Sequential Generative Adversarial Networks for   Symbolic Music Generation and Accompaniment

Hao-Wen Dong; Wen-Yi Hsiao; Li-Chia Yang; Yi-Hsuan Yang

arXiv:1709.06298·eess.AS·August 6, 2020·151 cites

MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment

Hao-Wen Dong, Wen-Yi Hsiao, Li-Chia Yang, Yi-Hsuan Yang

PDF

Open Access 5 Repos

TL;DR

MuseGAN introduces three GAN-based models for multi-track symbolic music generation, capable of producing coherent four-bar compositions and collaborative music with human input, evaluated through objective metrics and user studies.

Contribution

The paper presents novel GAN architectures tailored for multi-track symbolic music generation, addressing temporal and polyphonic complexities.

Findings

01

Models generate coherent four-bar music from scratch

02

Effective in human-AI collaborative music creation

03

Proposed metrics evaluate intra- and inter-track coherence

Abstract

Generating music has a few notable differences from generating images and videos. First, music is an art of time, necessitating a temporal model. Second, music is usually composed of multiple instruments/tracks with their own temporal dynamics, but collectively they unfold over time interdependently. Lastly, musical notes are often grouped into chords, arpeggios or melodies in polyphonic music, and thereby introducing a chronological ordering of notes is not naturally suitable. In this paper, we propose three models for symbolic multi-track music generation under the framework of generative adversarial networks (GANs). The three models, which differ in the underlying assumptions and accordingly the network architectures, are referred to as the jamming model, the composer model and the hybrid model. We trained the proposed models on a dataset of over one hundred thousand bars of rock…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Music and Audio Processing · Generative Adversarial Networks and Image Synthesis