Spatial Loss for Unsupervised Multi-channel Source Separation
Kohei Saijo, Robin Scheibler

TL;DR
This paper introduces a spatial loss for unsupervised multi-channel source separation that leverages the duality of DOA and beamforming, enabling effective training of neural separators without labeled data.
Contribution
It proposes a novel spatial loss based on DOA-beamforming duality, improving unsupervised training of neural source separation models.
Findings
Spatial loss enhances IVA-based separator training.
Combining spatial and signal losses benefits MVDR beamformer training.
Achieves near state-of-the-art results on LibriCSS without labeled data.
Abstract
We propose a spatial loss for unsupervised multi-channel source separation. The proposed loss exploits the duality of direction of arrival (DOA) and beamforming: the steering and beamforming vectors should be aligned for the target source, but orthogonal for interfering ones. The spatial loss encourages consistency between the mixing and demixing systems from a classic DOA estimator and a neural separator, respectively. With the proposed loss, we train the neural separators based on minimum variance distortionless response (MVDR) beamforming and independent vector analysis (IVA). We also investigate the effectiveness of combining our spatial loss and a signal loss, which uses the outputs of blind source separation as the reference. We evaluate our proposed method on synthetic and recorded (LibriCSS) mixtures. We find that the spatial loss is most effective to train IVA-based separators.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Blind Source Separation Techniques · Animal Vocal Communication and Behavior
