Similarity-and-Independence-Aware Beamformer: Method for Target Source   Extraction using Magnitude Spectrogram as Reference

Atsuo Hiroe

arXiv:2006.00772·eess.AS·September 22, 2023

Similarity-and-Independence-Aware Beamformer: Method for Target Source Extraction using Magnitude Spectrogram as Reference

Atsuo Hiroe

PDF

Open Access

TL;DR

This paper introduces the similarity-and-independence-aware beamformer (SIBF), a novel source extraction method that uses a magnitude spectrogram as a reference to improve accuracy over neural network-based methods.

Contribution

The paper extends deflationary independent component analysis by incorporating similarity and independence considerations, enabling more precise target source extraction.

Findings

01

SIBF outperforms DNN-based spectrograms in accuracy

02

Experimental validation on CHiME3 dataset

03

Introduces two source models reflecting similarity

Abstract

This study presents a novel method for source extraction, referred to as the similarity-and-independence-aware beamformer (SIBF). The SIBF extracts the target signal using a rough magnitude spectrogram as the reference signal. The advantage of the SIBF is that it can obtain an accurate target signal, compared to the spectrogram generated by target-enhancing methods such as the speech enhancement based on deep neural networks (DNNs). For the extraction, we extend the framework of the deflationary independent component analysis, by considering the similarity between the reference and extracted target, as well as the mutual independence of all potential sources. To solve the extraction problem by maximum-likelihood estimation, we introduce two source model types that can reflect the similarity. The experimental results from the CHiME3 dataset show that the target signal extracted by the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Blind Source Separation Techniques · Advanced Adaptive Filtering Techniques