Interpolation Filter Design for Sample Rate Independent Audio Effect RNNs
Alistair Carson, Alec Wright, Stefan Bilbao

TL;DR
This paper explores interpolation filter design to enable sample rate adjustments in RNN-based audio effects, allowing for flexible real-time audio processing with minimal artifacts.
Contribution
It introduces methods for designing extrapolation filters to perform sample rate conversion in RNNs, analyzing their impact on audio quality and stability.
Findings
Proper filter choice yields high-quality sample rate conversion
Sample rate adjustments can introduce artifacts and instability
Linearised stability analysis predicts suitable filters
Abstract
Recurrent neural networks (RNNs) are effective at emulating the non-linear, stateful behavior of analog guitar amplifiers and distortion effects. Unlike the case of direct circuit simulation, RNNs have a fixed sample rate encoded in their model weights, making the sample rate non-adjustable during inference. Recent work has proposed increasing the sample rate of RNNs at inference (oversampling) by increasing the feedback delay length in samples, using a fractional delay filter for non-integer conversions. Here, we investigate the task of lowering the sample rate at inference (undersampling), and propose using an extrapolation filter to approximate the required fractional signal advance. We consider two filter design methods and analyse the impact of filter order on audio quality. Our results show that the correct choice of filter can give high quality results for both oversampling and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Neural Networks and Applications
