TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids

Igor Fedorov; Marko Stamenovic; Carl Jensen; Li-Chia Yang; Ari; Mandell; Yiming Gan; Matthew Mattina; Paul N. Whatmough

arXiv:2005.11138·eess.AS·September 15, 2021

TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids

Igor Fedorov, Marko Stamenovic, Carl Jensen, Li-Chia Yang, Ari, Mandell, Yiming Gan, Matthew Mattina, Paul N. Whatmough

PDF

1 Repo

TL;DR

This paper presents TinyLSTMs, a highly efficient neural speech enhancement model optimized for hearing aids, using model compression techniques to significantly reduce size and computational load while maintaining audio quality.

Contribution

It introduces the first application of pruning and quantization techniques to RNN speech enhancement for hearing aids, achieving substantial model compression and efficiency gains.

Findings

01

11.9× reduction in model size

02

2.9× reduction in computational operations

03

Latency of 2.39ms, within target constraints

Abstract

Modern speech enhancement algorithms achieve remarkable noise suppression by means of large recurrent neural networks (RNNs). However, large RNNs limit practical deployment in hearing aid hardware (HW) form-factors, which are battery powered and run on resource-constrained microcontroller units (MCUs) with limited memory capacity and compute capability. In this work, we use model compression techniques to bridge this gap. We define the constraints imposed on the RNN by the HW and describe a method to satisfy them. Although model compression techniques are an active area of research, we are the first to demonstrate their efficacy for RNN speech enhancement, using pruning and integer quantization of weights/activations. We also demonstrate state update skipping, which reduces the computational load. Finally, we conduct a perceptual evaluation of the compressed models to verify audio…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BoseCorp/efficient-neural-speech-enhancement
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning