Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference
Atsuo Hiroe

TL;DR
This paper introduces the similarity-and-independence-aware beamformer (SIBF), which enhances target speech extraction by integrating similarity measures with ICA, and employs iterative casting and boost start for improved accuracy and convergence.
Contribution
The study presents a novel SIBF framework that incorporates similarity considerations into ICA, along with new methods for faster convergence and more accurate reference generation.
Findings
Improved extraction performance with boost start and iterative casting.
Development of a fixed point concept for accuracy assessment.
Unified formulation of SIBF and traditional mask-based beamformers.
Abstract
Target source extraction is significant for improving human speech intelligibility and the speech recognition performance of computers. This study describes a method for target source extraction, called the similarity-and-independence-aware beamformer (SIBF). The SIBF extracts the target source using a rough magnitude spectrogram as the reference signal. The advantage of the SIBF is that it can obtain a more accurate signal than the spectrogram generated by target-enhancing methods such as speech enhancement based on deep neural networks. For the extraction, we extend the framework of deflationary independent component analysis (ICA) by considering the similarities between the reference and extracted target sources, in addition to the mutual independence of all the potential sources. To solve the extraction problem by maximum-likelihood estimation, we introduce three source models that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsIndependent Component Analysis
