HyBeam: Hybrid Microphone-Beamforming Array-Agnostic Speech Enhancement for Wearables
Yuval Bar Ilan (1), Boaz Rafaely (1), Vladimir Tourbabin (2) ((1) School of Electrical, Computer Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel (2) Reality Labs Research, Meta, Redmond, WA, USA)

TL;DR
HyBeam is a hybrid speech enhancement framework that combines raw microphone signals and beamformer outputs to achieve robust, array-agnostic performance across diverse acoustic environments and wearable device configurations.
Contribution
It introduces a novel hybrid approach that leverages the strengths of both raw microphone signals and beamformer outputs for array-agnostic speech enhancement.
Findings
HyBeam outperforms baseline methods in PESQ, STOI, and SI-SDR.
Bandwise analysis shows effective use of beamformer at high frequencies and microphone cues at low frequencies.
Demonstrates robustness across diverse room and wearable array configurations.
Abstract
Speech enhancement is a fundamental challenge in signal processing, particularly when robustness is required across diverse acoustic conditions and microphone setups. Deep learning methods have been successful for speech enhancement, but often assume fixed array geometries, limiting their use in mobile, embedded, and wearable devices. Existing array-agnostic approaches typically rely on either raw microphone signals or beamformer outputs, but both have drawbacks under changing geometries. We introduce HyBeam, a hybrid framework that uses raw microphone signals at low frequencies and beamformer signals at higher frequencies, exploiting their complementary strengths while remaining highly array-agnostic. Simulations across diverse rooms and wearable array configurations demonstrate that HyBeam consistently surpasses microphone-only and beamformer-only baselines in PESQ, STOI, and SI-SDR.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
