Spatial Source Subtraction Based on Incomplete Measurements of Relative Transfer Function
Zbynek Koldovsky, Jiri Malek, Sharon Gannot

TL;DR
This paper introduces a novel compressed sensing-based method for estimating incomplete relative transfer functions from noisy microphone recordings, improving accuracy over traditional estimators in reverberant environments.
Contribution
It proposes a three-step approach combining conventional RTF estimation, frequency selection, and sparse reconstruction via a weighted $$ convex program, enhancing RTF estimation accuracy.
Findings
Improved RTF estimates in noisy, reverberant conditions.
Enhanced performance over traditional estimators.
Validated with extensive real-world recordings.
Abstract
Relative impulse responses between microphones are usually long and dense due to the reverberant acoustic environment. Estimating them from short and noisy recordings poses a long-standing challenge of audio signal processing. In this paper we apply a novel strategy based on ideas of Compressed Sensing. Relative transfer function (RTF) corresponding to the relative impulse response can often be estimated accurately from noisy data but only for certain frequencies. This means that often only an incomplete measurement of the RTF is available. A complete RTF estimate can be obtained through finding its sparsest representation in the time-domain: that is, through computing the sparsest among the corresponding relative impulse responses. Based on this approach, we propose to estimate the RTF from noisy data in three steps. First, the RTF is estimated using any conventional method such as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
