Flexible Control in Symbolic Music Generation via Musical Metadata

Sangjun Han; Jiwon Ham; Chaeeun Lee; Heejin Kim; Soojong Do; Sihyuk; Yi; Jun Seo; Seoyoon Kim; Yountae Jung; Woohyung Lim

arXiv:2409.07467·cs.SD·September 13, 2024

Flexible Control in Symbolic Music Generation via Musical Metadata

Sangjun Han, Jiwon Ham, Chaeeun Lee, Heejin Kim, Soojong Do, Sihyuk, Yi, Jun Seo, Seoyoon Kim, Yountae Jung, Woohyung Lim

PDF

Open Access

TL;DR

This paper presents a flexible symbolic music generation method using an autoregressive model that incorporates musical metadata, allowing for controllable and high-quality music synthesis with demonstrated effectiveness and user control.

Contribution

Introduces a novel autoregressive model that enables flexible control in symbolic music generation by using randomly dropped musical metadata tokens during training.

Findings

01

Model achieves high musical fidelity and diversity.

02

Enhanced controllability over music generation.

03

Outperforms other models in subjective quality tests.

Abstract

In this work, we introduce the demonstration of symbolic music generation, focusing on providing short musical motifs that serve as the central theme of the narrative. For the generation, we adopt an autoregressive model which takes musical metadata as inputs and generates 4 bars of multitrack MIDI sequences. During training, we randomly drop tokens from the musical metadata to guarantee flexible control. It provides users with the freedom to select input types while maintaining generative performance, enabling greater flexibility in music composition. We validate the effectiveness of the strategy through experiments in terms of model capacity, musical fidelity, diversity, and controllability. Additionally, we scale up the model and compare it with other music generation model through a subjective test. Our results indicate its superiority in both control and music quality. We provide a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies