Similarity Metrics For Late Reverberation
Gloria Dal Santo, Karolina Prawda, Sebastian J. Schlecht, Vesa V\"alim\"aki

TL;DR
This paper introduces two new differentiable similarity metrics tailored for late reverberation in room impulse responses, improving automatic reverberation algorithm tuning within machine learning frameworks.
Contribution
The paper proposes novel reverberation-specific similarity metrics based on power and energy decay, outperforming existing general audio metrics in various room configurations.
Findings
Metrics outperform baseline audio similarity measures
Power-based metric shows the best performance
Metrics are differentiable and suitable for machine learning
Abstract
Automatic tuning of reverberation algorithms relies on the optimization of a cost function. While general audio similarity metrics are useful, they are not optimized for the specific statistical properties of reverberation in rooms. This paper presents two novel metrics for assessing the similarity of late reverberation in room impulse responses. These metrics are differentiable and can be utilized within a machine-learning framework. We compare the performance of these metrics to two popular audio metrics using a large dataset of room impulse responses encompassing various room configurations and microphone positions. The results indicate that the proposed functions based on averaged power and frequency-band energy decay outperform the baselines with the former exhibiting the most suitable profile towards the minimum. The proposed work holds promise as an improvement to the design and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Underwater Acoustics Research · Acoustic Wave Phenomena Research
