DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With   Autoencoding Generative Adversarial Networks

Javier Nistal; Cyran Aouameur; Ithan Velarde; and Stefan Lattner

arXiv:2206.14723·cs.SD·June 30, 2022

DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With Autoencoding Generative Adversarial Networks

Javier Nistal, Cyran Aouameur, Ithan Velarde, and Stefan Lattner

PDF

Open Access

TL;DR

DrumGAN VST is a deep learning-based plugin that synthesizes and manipulates drum sounds by encoding and generating audio through a GAN, simplifying sound design in music production.

Contribution

The paper introduces DrumGAN VST, a novel plugin that uses a GAN with an encoding network for high-level control and resynthesis of drum sounds in real-time.

Findings

01

Operates at 44.1 kHz sample rate

02

Provides independent, continuous control over instrument classes

03

Enables resynthesis and manipulation of existing sounds

Abstract

In contemporary popular music production, drum sound design is commonly performed by cumbersome browsing and processing of pre-recorded samples in sound libraries. One can also use specialized synthesis hardware, typically controlled through low-level, musically meaningless parameters. Today, the field of Deep Learning offers methods to control the synthesis process via learned high-level features and allows generating a wide variety of sounds. In this paper, we present DrumGAN VST, a plugin for synthesizing drum sounds using a Generative Adversarial Network. DrumGAN VST operates on 44.1 kHz sample-rate audio, offers independent and continuous instrument class controls, and features an encoding neural network that maps sounds into the GAN's latent space, enabling resynthesis and manipulation of pre-existing drum sounds. We provide numerous sound examples and a demo of the proposed VST…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing