Differentiable Artificial Reverberation
Sungho Lee, Hyeong-Seok Choi, and Kyogu Lee

TL;DR
This paper introduces differentiable artificial reverberation models that enable end-to-end training for more accurate AR parameter estimation, overcoming computational bottlenecks by using FIR approximations of IIR filters.
Contribution
The paper proposes FIR-based approximations of IIR filters in DAR models, enabling efficient end-to-end training for AR parameter estimation in reverberation modeling.
Findings
End-to-end trained DAR models outperform non-end-to-end approaches.
FIR approximations significantly improve training speed on GPU.
Subjective listening tests favor the proposed method.
Abstract
Artificial reverberation (AR) models play a central role in various audio applications. Therefore, estimating the AR model parameters (ARPs) of a reference reverberation is a crucial task. Although a few recent deep-learning-based approaches have shown promising performance, their non-end-to-end training scheme prevents them from fully exploiting the potential of deep neural networks. This motivates the introduction of differentiable artificial reverberation (DAR) models, allowing loss gradients to be back-propagated end-to-end. However, implementing the AR models with their difference equations "as is" in the deep learning framework severely bottlenecks the training speed when executed with a parallel processor like GPU due to their infinite impulse response (IIR) components. We tackle this problem by replacing the IIR filters with finite impulse response (FIR) approximations with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Acoustic Wave Phenomena Research
