Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?
Atsuo Hiroe, Katsutoshi Itoyama, Kazuhiro Nakadai

TL;DR
This paper presents a unified framework for mask-based beamformers that encompasses all variations, demonstrating that all can achieve optimal target extraction performance and introducing mask-based scaling for realistic scenarios.
Contribution
The study develops a comprehensive framework covering all mask-based beamformer variations, including new ones, and shows they can all reach theoretical upper-bound performance.
Findings
All 12 BF variations achieve the theoretical upper-bound performance.
Mask-based scaling can mimic ideal scaling even in realistic conditions.
The framework aids in designing TSE systems and estimating BF performance.
Abstract
This study investigates mask-based beamformers (BFs), which estimate filters for target sound extraction (TSE) using time-frequency masks. Although multiple mask-based BFs have been proposed, no consensus has been reached on which one offers the best target-extraction performance. Previously, we found that maximum signal-to-noise ratio and minimum mean square error (MSE) BFs can achieve the same extraction performance as the theoretical upper-bound performance, with each BF containing a different optimal mask. However, two issues remained unsolved: only two BFs were covered, excluding the minimum variance distortionless response BF, and ideal scaling (IS) was employed to ideally adjust the output scale, which is not applicable to realistic scenarios. To address these issues, this study proposes a unified framework for mask-based BFs comprising two processes: filter estimation that can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhotoacoustic and Ultrasonic Imaging · Advancements in Photolithography Techniques · Laser Design and Applications
