BERP: A Blind Estimator of Room Parameters for Single-Channel Noisy Speech Signals
Lijun Wang, Yixian Lu, Ziyan Gao, Kai Li, Jianqiang Huang, Yuntao Kong, Shogo Okada

TL;DR
BERP is a novel deep learning framework that accurately estimates room acoustical parameters, geometrical parameters, and occupancy levels from single-channel noisy speech, outperforming existing methods in real-world scenarios.
Contribution
The paper introduces BERP, a unified neural network model combining attention and convolutional layers for robust, multitask estimation of multiple room parameters from noisy speech signals.
Findings
BERP achieves state-of-the-art accuracy in estimating room parameters.
The model demonstrates high robustness in real-world noisy environments.
Separate predictors improve per-task estimation performance.
Abstract
Room acoustical parameters (RAPs), room geometrical parameters (RGPs) and instantaneous occupancy level are essential metrics for parameterizing the room acoustical characteristics (RACs) of a sound field around a listener's local environment, offering comprehensive indications for various applications. Current blind estimation methods either fail to cover a broad range of real-world acoustic environments in the context of real background noise or estimate only a few RAPs and RGPs from noisy single-channel speech signals. In addition, they are limited in their ability to estimate the instantaneous occupancy level. In this paper, we propose a new universal blind estimation framework called the blind estimator of room parameters (BERP) to estimate RAPs, RGPs and occupancy level via a unified methodology. It consists of two modules: a unified room feature encoder that combines attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Adaptive Filtering Techniques
