Conditional Drums Generation using Compound Word Representations

Dimos Makris; Guo Zixun; Maximos Kaliakatsos-Papakostas; Dorien; Herremans

arXiv:2202.04464·cs.SD·February 22, 2022

Conditional Drums Generation using Compound Word Representations

Dimos Makris, Guo Zixun, Maximos Kaliakatsos-Papakostas, Dorien, Herremans

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel data encoding scheme inspired by Compound Word representations for conditional drums generation, utilizing a sequence-to-sequence model with BiLSTM and Transformer components to produce realistic, coherent drum sequences conditioned on musical context.

Contribution

The paper presents a new compound word-inspired encoding scheme and a sequence-to-sequence architecture for improved conditional drums generation in music composition.

Findings

01

Generated drums match training data statistical features

02

Sequences are perceived as pleasant and natural in listening tests

03

Model outperforms several baseline methods

Abstract

The field of automatic music composition has seen great progress in recent years, specifically with the invention of transformer-based architectures. When using any deep learning model which considers music as a sequence of events with multiple complex dependencies, the selection of a proper data representation is crucial. In this paper, we tackle the task of conditional drums generation using a novel data encoding scheme inspired by the Compound Word representation, a tokenization process of sequential data. Therefore, we present a sequence-to-sequence architecture where a Bidirectional Long short-term memory (BiLSTM) Encoder receives information about the conditioning parameters (i.e., accompanying tracks and musical attributes), while a Transformer-based Decoder with relative global attention produces the generated drum sequences. We conducted experiments to thoroughly compare the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

melkor169/cp_drums_generation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech Recognition and Synthesis