SYMPLEX: Controllable Symbolic Music Generation using Simplex Diffusion   with Vocabulary Priors

Nicolas Jonason; Luca Casini; Bob L.T. Sturm

arXiv:2405.12666·cs.SD·May 22, 2024

SYMPLEX: Controllable Symbolic Music Generation using Simplex Diffusion with Vocabulary Priors

Nicolas Jonason, Luca Casini, Bob L.T. Sturm

PDF

Open Access

TL;DR

This paper introduces SYMPLEX, a novel diffusion-based model for fast, controllable symbolic music generation that leverages vocabulary priors to enable detailed manipulation of musical features without additional training.

Contribution

The paper presents a new simplex diffusion approach for symbolic music generation that allows for direct control over musical attributes through vocabulary priors, without task-specific adaptation.

Findings

01

Effective control over music features via vocabulary priors

02

Fast generation of 4-bar multi-instrument loops

03

No need for extrinsic control or model retraining

Abstract

We present a new approach for fast and controllable generation of symbolic music based on the simplex diffusion, which is essentially a diffusion process operating on probabilities rather than the signal space. This objective has been applied in domains such as natural language processing but here we apply it to generating 4-bar multi-instrument music loops using an orderless representation. We show that our model can be steered with vocabulary priors, which affords a considerable level control over the music generation process, for instance, infilling in time and pitch and choice of instrumentation -- all without task-specific model adaptation or applying extrinsic control.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception

MethodsDiffusion