Directional Sparse Filtering using Weighted Lehmer Mean for Blind   Separation of Unbalanced Speech Mixtures

Karn Watcharasupat; Anh H. T. Nguyen; Ching-Hui Ooi; Andy W.; H. Khong

arXiv:2102.00196·eess.AS·July 21, 2021

Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures

Karn Watcharasupat, Anh H. T. Nguyen, Ching-Hui Ooi, Andy W., H. Khong

PDF

2 Repos

TL;DR

This paper introduces a novel directional sparse filtering algorithm that uses weighted Lehmer mean to effectively separate unbalanced speech sources in various acoustic environments.

Contribution

It presents a new DSF-based method with learnable weights for the Lehmer mean, addressing source imbalance in blind speech separation.

Findings

01

Improved separation performance over baseline methods

02

Effective in multiple real acoustic environments

03

Adaptive handling of source imbalance

Abstract

In blind source separation of speech signals, the inherent imbalance in the source spectrum poses a challenge for methods that rely on single-source dominance for the estimation of the mixing matrix. We propose an algorithm based on the directional sparse filtering (DSF) framework that utilizes the Lehmer mean with learnable weights to adaptively account for source imbalance. Performance evaluation in multiple real acoustic environments show improvements in source separation compared to the baseline methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDirectional Sparse FIltering