A Small-footprint Acoustic Echo Cancellation Solution for Mobile Full-Duplex Speech Interactions
Yiheng Jiang, Tian Biao

TL;DR
This paper introduces a neural network-based, small-footprint Acoustic Echo Cancellation solution optimized for mobile devices, improving speech quality and downstream speech processing tasks in full-duplex interactions.
Contribution
It presents a robust, low-latency AEC model with data augmentation, progressive learning, and tailored post-processing for enhanced mobile speech interaction performance.
Findings
Effective echo suppression demonstrated in empirical tests.
Significant improvements in VAD and ASR accuracy.
Model suitable for real-time mobile deployment.
Abstract
In full-duplex speech interaction systems, effective Acoustic Echo Cancellation (AEC) is crucial for recovering echo-contaminated speech. This paper presents a neural network-based AEC solution to address challenges in mobile scenarios with varying hardware, nonlinear distortions and long latency. We first incorporate diverse data augmentation strategies to enhance the model's robustness across various environments. Moreover, progressive learning is employed to incrementally improve AEC effectiveness, resulting in a considerable improvement in speech quality. To further optimize AEC's downstream applications, we introduce a novel post-processing strategy employing tailored parameters designed specifically for tasks such as Voice Activity Detection (VAD) and Automatic Speech Recognition (ASR), thus enhancing their overall efficacy. Finally, our method employs a small-footprint model with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Hearing Loss and Rehabilitation
