# Keyword Spotting for Hearing Assistive Devices Robust to External   Speakers

**Authors:** Iv\'an L\'opez-Espejo, Zheng-Hua Tan, Jesper Jensen

arXiv: 1906.09417 · 2019-06-27

## TL;DR

This paper presents a multi-task deep residual network for keyword spotting in hearing assistive devices that is robust to external speakers, significantly improving accuracy by 32% over traditional systems.

## Contribution

It introduces a multi-task learning approach that jointly performs keyword spotting and external speaker detection for hearing aids, enhancing robustness against external speakers.

## Key findings

- Achieves around 32% relative improvement in keyword spotting accuracy.
- Successfully extends deep residual networks with minimal parameter increase.
- Demonstrates effectiveness using a Google Speech Commands-based corpus.

## Abstract

Keyword spotting (KWS) is experiencing an upswing due to the pervasiveness of small electronic devices that allow interaction with them via speech. Often, KWS systems are speaker-independent, which means that any person --user or not-- might trigger them. For applications like KWS for hearing assistive devices this is unacceptable, as only the user must be allowed to handle them. In this paper we propose KWS for hearing assistive devices that is robust to external speakers. A state-of-the-art deep residual network for small-footprint KWS is regarded as a basis to build upon. By following a multi-task learning scheme, this system is extended to jointly perform KWS and users' own-voice/external speaker detection with a negligible increase in the number of parameters. For experiments, we generate from the Google Speech Commands Dataset a speech corpus emulating hearing aids as a capturing device. Our results show that this multi-task deep residual network is able to achieve a KWS accuracy relative improvement of around 32% with respect to a system that does not deal with external speakers.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.09417/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1906.09417/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/1906.09417/full.md

---
Source: https://tomesphere.com/paper/1906.09417