A Composite T60 Regression and Classification Approach for Speech   Dereverberation

Yuying Li; Yuchen Liu; Donald S.Williamson

arXiv:2302.04932·eess.AS·February 13, 2023

A Composite T60 Regression and Classification Approach for Speech Dereverberation

Yuying Li, Yuchen Liu, Donald S.Williamson

PDF

Open Access

TL;DR

This paper introduces a joint learning framework that combines T60 reverberation time estimation with dereverberation to improve speech quality in reverberant environments, leveraging acoustic environment information.

Contribution

It presents a novel composite approach that jointly estimates T60 and performs dereverberation, enhancing performance over existing methods.

Findings

01

Improved dereverberation performance in simulated environments.

02

Enhanced speech quality in real-world reverberant settings.

03

Effective integration of T60 estimation with dereverberation.

Abstract

Dereverberation is often performed directly on the reverberant audio signal, without knowledge of the acoustic environment. Reverberation time, T60, however, is an essential acoustic factor that reflects how reverberation may impact a signal. In this work, we propose to perform dereverberation while leveraging key acoustic information from the environment. More specifically, we develop a joint learning approach that uses a composite T60 module and a separate dereverberation module to simultaneously perform reverberation time estimation and dereverberation. The reverberation time module provides key features to the dereverberation module during fine tuning. We evaluate our approach in simulated and real environments, and compare against several approaches. The results show that this composite framework improves performance in environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Hearing Loss and Rehabilitation