Differentiable Modelling of Percussive Audio with Transient and Spectral   Synthesis

Jordie Shier; Franco Caspe; Andrew Robertson; Mark Sandler,; Charalampos Saitis; Andrew McPherson

arXiv:2309.06649·cs.SD·September 14, 2023·2 cites

Differentiable Modelling of Percussive Audio with Transient and Spectral Synthesis

Jordie Shier, Franco Caspe, Andrew Robertson, Mark Sandler,, Charalampos Saitis, Andrew McPherson

PDF

Open Access 1 Repo

TL;DR

This paper introduces a differentiable synthesis framework for percussive sounds that explicitly models transients using sinusoidal and transient encoders, improving the reconstruction of drum sounds.

Contribution

It presents a novel DDSP-based model combining sinusoidal modeling with transient generation via temporal convolutional networks for percussive audio synthesis.

Findings

01

Improved onset signal reconstruction for membranophone percussion.

02

Effective joint training of noise, transient, and sinusoidal encoders.

03

Enhanced interpretability of percussive sound synthesis.

Abstract

Differentiable digital signal processing (DDSP) techniques, including methods for audio synthesis, have gained attention in recent years and lend themselves to interpretability in the parameter space. However, current differentiable synthesis methods have not explicitly sought to model the transient portion of signals, which is important for percussive sounds. In this work, we present a unified synthesis framework aiming to address transient generation and percussive synthesis within a DDSP framework. To this end, we propose a model for percussive synthesis that builds on sinusoidal modeling synthesis and incorporates a modulated temporal convolutional network for transient generation. We use a modified sinusoidal peak picking algorithm to generate time-varying non-harmonic sinusoids and pair it with differentiable noise and transient encoders that are jointly trained to reconstruct…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jorshi/drumblender
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Music and Audio Processing · Speech and Audio Processing