Efficient Independent Vector Extraction of Dominant Target Speech

Lele Liao; Zhaoyi Gu; Jing Lu

arXiv:2008.00143·cs.SD·August 4, 2020

Efficient Independent Vector Extraction of Dominant Target Speech

Lele Liao, Zhaoyi Gu, Jing Lu

PDF

Open Access

TL;DR

This paper introduces a computationally efficient blind speech extraction method that isolates a dominant target speaker using a modified independent vector analysis algorithm, suitable when the target's average power exceeds interference.

Contribution

The paper presents a novel, simplified blind speech extraction technique based on a modified independent vector analysis, addressing computational complexity and scaling ambiguity.

Findings

01

Effective extraction of dominant speech demonstrated in simulations.

02

Reduced computational complexity compared to full blind source separation.

03

Successful handling of scaling ambiguity with a new one-unit scaling operation.

Abstract

The complete decomposition performed by blind source separation is computationally demanding and superfluous when only the speech of one specific target speaker is desired. In this paper, we propose a computationally efficient blind speech extraction method based on a proper modification of the commonly utilized independent vector analysis algorithm, under the mild assumption that the average power of signal of interest outweighs interfering speech sources. Considering that the minimum distortion principle cannot be implemented since the full demixing matrix is not available, we also design a one-unit scaling operation to solve the scaling ambiguity. Simulations validate the efficacy of the proposed method in extracting the dominant speech.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Speech and Audio Processing · Advanced Adaptive Filtering Techniques