BERP: A Blind Estimator of Room Parameters for Single-Channel Noisy Speech Signals

Lijun Wang; Yixian Lu; Ziyan Gao; Kai Li; Jianqiang Huang; Yuntao Kong; Shogo Okada

arXiv:2405.04476·eess.AS·June 2, 2025

BERP: A Blind Estimator of Room Parameters for Single-Channel Noisy Speech Signals

Lijun Wang, Yixian Lu, Ziyan Gao, Kai Li, Jianqiang Huang, Yuntao Kong, Shogo Okada

PDF

Open Access 1 Repo

TL;DR

BERP is a novel deep learning framework that accurately estimates room acoustical parameters, geometrical parameters, and occupancy levels from single-channel noisy speech, outperforming existing methods in real-world scenarios.

Contribution

The paper introduces BERP, a unified neural network model combining attention and convolutional layers for robust, multitask estimation of multiple room parameters from noisy speech signals.

Findings

01

BERP achieves state-of-the-art accuracy in estimating room parameters.

02

The model demonstrates high robustness in real-world noisy environments.

03

Separate predictors improve per-task estimation performance.

Abstract

Room acoustical parameters (RAPs), room geometrical parameters (RGPs) and instantaneous occupancy level are essential metrics for parameterizing the room acoustical characteristics (RACs) of a sound field around a listener's local environment, offering comprehensive indications for various applications. Current blind estimation methods either fail to cover a broad range of real-world acoustic environments in the context of real background noise or estimate only a few RAPs and RGPs from noisy single-channel speech signals. In addition, they are limited in their ability to estimate the instantaneous occupancy level. In this paper, we propose a new universal blind estimation framework called the blind estimator of room parameters (BERP) to estimate RAPs, RGPs and occupancy level via a unified methodology. It consists of two modules: a unified room feature encoder that combines attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Alizeded/BERP
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Adaptive Filtering Techniques