Semi-blind source separation using convolutive transfer function for   nonlinear acoustic echo cancellation

Guoliang Cheng; Lele Liao; Kai Chen; Yuxiang Hu; Changbao Zhu; and; Jing Lu

arXiv:2207.01556·eess.AS·January 18, 2023

Semi-blind source separation using convolutive transfer function for nonlinear acoustic echo cancellation

Guoliang Cheng, Lele Liao, Kai Chen, Yuxiang Hu, Changbao Zhu, and, Jing Lu

PDF

Open Access 1 Repo

TL;DR

This paper introduces improved semi-blind source separation methods for nonlinear acoustic echo cancellation using convolutive transfer function approximation, achieving better performance in reverberant environments.

Contribution

It proposes two new SBSS methods based on AuxIVA and ILRMA with CTF approximation, enhancing real-time applicability and echo cancellation effectiveness.

Findings

01

Significantly improved echo cancellation performance.

02

Effective modeling of long impulse responses with short latency.

03

Better stability and convergence in reverberant environments.

Abstract

The recently proposed semi-blind source separation (SBSS) method for nonlinear acoustic echo cancellation (NAEC) outperforms adaptive NAEC in attenuating the nonlinear acoustic echo. However, the multiplicative transfer function (MTF) approximation makes it unsuitable for real-time applications especially in highly reverberant environments, and the natural gradient makes it hard to balance well between fast convergence speed and stability. In this paper, we propose two more effective SBSS methods based on auxiliary-function-based independent vector analysis (AuxIVA) and independent low-rank matrix analysis (ILRMA). The convolutive transfer function (CTF) approximation is used instead of MTF so that a long impulse response can be modeled with a short latency. The optimization schemes used in AuxIVA and ILRMA are carefully regularized according to the constrained demixing matrix of NAEC.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chengguoliang0/audio-samples2
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Speech and Audio Processing · Advanced Adaptive Filtering Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings