Two-step Band-split Neural Network Approach for Full-band Residual Echo   Suppression

Zihan Zhang; Shimin Zhang; Mingshuai Liu; Yanhong Leng; Zhe Han; Li; Chen; Lei Xie

arXiv:2303.06828·eess.AS·March 14, 2023·ICASSP·1 cites

Two-step Band-split Neural Network Approach for Full-band Residual Echo Suppression

Zihan Zhang, Shimin Zhang, Mingshuai Liu, Yanhong Leng, Zhe Han, Li, Chen, Lei Xie

PDF

Open Access

TL;DR

This paper introduces a two-step neural network method for full-band residual echo suppression, splitting signals into wide and high bands for targeted processing, achieving high MOS scores and ranking second in a challenge.

Contribution

The novel two-step band-split neural network approach effectively handles full-band residual echo suppression with improved accuracy and reduced complexity.

Findings

01

Achieved MOS of 4.344 on ICASSP 2023 AEC Challenge

02

Ranked 2nd (tied) in the non-personalized track

03

Effective separation of wide-band and high-band signals

Abstract

This paper describes a Two-step Band-split Neural Network (TBNN) approach for full-band acoustic echo cancellation. Specifically, after linear filtering, we split the full-band signal into wide-band (16KHz) and high-band (16-48KHz) for residual echo removal with lower modeling difficulty. The wide-band signal is processed by an updated gated convolutional recurrent network (GCRN) with U $^{2}$ encoder while the high-band signal is processed by a high-band post-filter net with lower complexity. Our approach submitted to ICASSP 2023 AEC Challenge has achieved an overall mean opinion score (MOS) of 4.344 and a word accuracy (WAcc) ratio of 0.795, leading to the 2 $^{n d}$ (tied) in the ranking of the non-personalized track.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Adaptive Filtering Techniques