Synthetic Wave-Geometric Impulse Responses for Improved Speech   Dereverberation

Rohith Aralikatti; Zhenyu Tang; Dinesh Manocha

arXiv:2212.05360·eess.AS·December 13, 2022

Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation

Rohith Aralikatti, Zhenyu Tang, Dinesh Manocha

PDF

Open Access

TL;DR

This paper introduces a hybrid synthetic dataset for speech dereverberation, combining wave-based and geometric methods to better simulate room acoustics, leading to improved dereverberation performance.

Contribution

The paper proposes a novel hybrid synthetic RIR dataset that enhances speech dereverberation models by accurately simulating low-frequency components.

Findings

01

Models trained on hybrid synthetic RIRs outperform those trained on purely geometric RIRs.

02

Accurate low-frequency simulation is crucial for effective dereverberation.

03

Hybrid dataset improves performance across multiple real-world RIR datasets.

Abstract

We present a novel approach to improve the performance of learning-based speech dereverberation using accurate synthetic datasets. Our approach is designed to recover the reverb-free signal from a reverberant speech signal. We show that accurately simulating the low-frequency components of Room Impulse Responses (RIRs) is important to achieving good dereverberation. We use the GWA dataset that consists of synthetic RIRs generated in a hybrid fashion: an accurate wave-based solver is used to simulate the lower frequencies and geometric ray tracing methods simulate the higher frequencies. We demonstrate that speech dereverberation models trained on hybrid synthetic RIRs outperform models trained on RIRs generated by prior geometric ray tracing methods on four real-world RIR datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Indoor and Outdoor Localization Technologies · Advanced Adaptive Filtering Techniques