Sample Rate Independent Recurrent Neural Networks for Audio Effects   Processing

Alistair Carson; Alec Wright; Jatin Chowdhury; Vesa V\"alim\"aki,; Stefan Bilbao

arXiv:2406.06293·eess.AS·June 11, 2024·1 cites

Sample Rate Independent Recurrent Neural Networks for Audio Effects Processing

Alistair Carson, Alec Wright, Jatin Chowdhury, Vesa V\"alim\"aki,, Stefan Bilbao

PDF

Open Access

TL;DR

This paper explores methods to modify recurrent neural networks for audio effects to operate reliably across different sample rates, introducing novel techniques for sample rate independence and demonstrating their effectiveness.

Contribution

It proposes new methods for making RNN-based audio models sample rate independent, including delay-based and interpolation techniques, with comprehensive evaluation.

Findings

01

Delay-based approach achieves high fidelity sample rate conversion

02

Cubic Lagrange interpolation significantly improves non-integer sample rate adjustment

03

First in-depth study on sample rate independence in RNN audio models

Abstract

In recent years, machine learning approaches to modelling guitar amplifiers and effects pedals have been widely investigated and have become standard practice in some consumer products. In particular, recurrent neural networks (RNNs) are a popular choice for modelling non-linear devices such as vacuum tube amplifiers and distortion circuitry. One limitation of such models is that they are trained on audio at a specific sample rate and therefore give unreliable results when operating at another rate. Here, we investigate several methods of modifying RNN structures to make them approximately sample rate independent, with a focus on oversampling. In the case of integer oversampling, we demonstrate that a previously proposed delay-based approach provides high fidelity sample rate conversion whilst additionally reducing aliasing. For non-integer sample rate adjustment, we propose two novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Neural Networks and Applications

MethodsFocus