Determined Multichannel Blind Source Separation with Clustered Source   Model

Jianyu Wang; Shanzheng Guan

arXiv:2405.03118·cs.SD·May 7, 2024

Determined Multichannel Blind Source Separation with Clustered Source Model

Jianyu Wang, Shanzheng Guan

PDF

Open Access

TL;DR

This paper introduces a novel multichannel blind source separation method using a clustered source model based on nonnegative block-term decomposition, improving source independence modeling and outperforming existing ILRMA techniques.

Contribution

It proposes a new clustered source model with NBTD that captures inter-channel dependencies and allows for orthogonality constraints, enhancing separation performance.

Findings

01

Outperforms ILRMA in anechoic conditions

02

Surpasses ILRMA in reverberant environments

03

Provides interpretable latent source representations

Abstract

The independent low-rank matrix analysis (ILRMA) method stands out as a prominent technique for multichannel blind audio source separation. It leverages nonnegative matrix factorization (NMF) and nonnegative canonical polyadic decomposition (NCPD) to model source parameters. While it effectively captures the low-rank structure of sources, the NMF model overlooks inter-channel dependencies. On the other hand, NCPD preserves intrinsic structure but lacks interpretable latent factors, making it challenging to incorporate prior information as constraints. To address these limitations, we introduce a clustered source model based on nonnegative block-term decomposition (NBTD). This model defines blocks as outer products of vectors (clusters) and matrices (for spectral structure modeling), offering interpretable latent vectors. Moreover, it enables straightforward integration of orthogonality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Speech and Audio Processing · Advanced Algorithms and Applications