Raking the Cocktail Party
Ivan Dokmani\'c, Robin Scheibler, Martin Vetterli

TL;DR
This paper introduces the acoustic rake receiver, a novel microphone beamformer that leverages echoes as additional spatial information to enhance noise and interference suppression in speech signals.
Contribution
It proposes a new spatial processing approach that uses echoes to improve beamforming, with theoretical and numerical validation showing significant performance gains.
Findings
Significant noise and interference suppression improvements.
Enhanced speech quality as measured by PESQ.
Theoretical and numerical validation of the proposed methods.
Abstract
We present the concept of an acoustic rake receiver---a microphone beamformer that uses echoes to improve the noise and interference suppression. The rake idea is well-known in wireless communications; it involves constructively combining different multipath components that arrive at the receiver antennas. Unlike spread-spectrum signals used in wireless communications, speech signals are not orthogonal to their shifts. Therefore, we focus on the spatial structure, rather than temporal. Instead of explicitly estimating the channel, we create correspondences between early echoes in time and image sources in space. These multiple sources of the desired and the interfering signal offer additional spatial diversity that we can exploit in the beamformer design. We present several "intuitive" and optimal formulations of acoustic rake receivers, and show theoretically and numerically that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
