A Multi-stage Low-latency Enhancement System for Hearing Aids

Chengwei Ouyang; Kexin Fei; Haoshuai Zhou; Congxi Lu; Linkai Li

arXiv:2508.04283·eess.AS·August 7, 2025

A Multi-stage Low-latency Enhancement System for Hearing Aids

Chengwei Ouyang, Kexin Fei, Haoshuai Zhou, Congxi Lu, Linkai Li

PDF

TL;DR

This paper presents a low-latency, multi-stage speech enhancement system for hearing aids that leverages phase information, head rotation data, and a novel windowing approach to improve speech clarity within strict latency constraints.

Contribution

It introduces a multi-stage enhancement framework utilizing phase and magnitude domains, asymmetric windowing, and head rotation data, advancing hearing aid speech processing.

Findings

01

Improved HASPI scores with the proposed system.

02

Effective utilization of phase information for enhancement.

03

Achieved low-latency processing within 5ms.

Abstract

This paper proposes an end-to-end system for the ICASSP 2023 Clarity Challenge. In this work, we introduce four major novelties: (1) a novel multi-stage system in both the magnitude and complex domains to better utilize phase information; (2) an asymmetric window pair to achieve higher frequency resolution with the 5ms latency constraint; (3) the integration of head rotation information and the mixture signals to achieve better enhancement; (4) a post-processing module that achieves higher hearing aid speech perception index (HASPI) scores with the hearing aid amplification stage provided by the baseline system.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.