CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio Barnab\`o, Giovanni Trappolini, Lorenzo Lastilla, Cesare, Campagnano, Angela Fan, Fabio Petroni, Fabrizio Silvestri

TL;DR
CycleDRUMS uses CycleGAN to automatically generate drum tracks from bass lines by translating mel-spectrograms, enabling realistic and musically coherent audio arrangements without paired training data.
Contribution
This work introduces a novel application of CycleGAN for audio-to-audio translation in music, specifically generating drums from bass lines, and compares it with paired methods like Pix2Pix.
Findings
CycleDRUMS produces credible drum tracks that align with bass lines.
Unsupervised CycleGAN outperforms paired Pix2Pix in this task.
Proposed metric partially evaluates the quality of generated music.
Abstract
The two main research threads in computer-based music generation are: the construction of autonomous music-making systems, and the design of computer-based environments to assist musicians. In the symbolic domain, the key problem of automatically arranging a piece music was extensively studied, while relatively fewer systems tackled this challenge in the audio domain. In this contribution, we propose CycleDRUMS, a novel method for generating drums given a bass line. After converting the waveform of the bass into a mel-spectrogram, we are able to automatically generate original drums that follow the beat, sound credible and can be directly mixed with the input bass. We formulated this task as an unpaired image-to-image translation problem, and we addressed it with CycleGAN, a well-established unsupervised style transfer framework, originally designed for treating images. The choice to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Music and Audio Processing · Generative Adversarial Networks and Image Synthesis
MethodsBatch Normalization · Instance Normalization · Tanh Activation · Concatenated Skip Connection · Dropout · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · GAN Least Squares Loss · Sigmoid Activation · Convolution
