# Max-Pooling Loss Training of Long Short-Term Memory Networks for   Small-Footprint Keyword Spotting

**Authors:** Ming Sun, Anirudh Raju, George Tucker, Sankaran Panchapagesan,, Gengshen Fu, Arindam Mandal, Spyros Matsoukas, Nikko Strom, Shiv Vitaladevuni

arXiv: 1705.02411 · 2017-05-09

## TL;DR

This paper introduces a max-pooling loss function for training LSTM networks aimed at small-footprint keyword spotting, demonstrating improved performance over traditional methods with low resource requirements.

## Contribution

The study proposes a novel max-pooling loss training approach for LSTM-based keyword spotting, showing it outperforms standard cross-entropy training and baseline models.

## Key findings

- Max-pooling loss trained LSTM outperforms cross-entropy trained LSTM.
- Max-pooling loss with pre-training yields best performance.
- Achieves 67.6% relative reduction in AUC over baseline DNN.

## Abstract

We propose a max-pooling based loss function for training Long Short-Term Memory (LSTM) networks for small-footprint keyword spotting (KWS), with low CPU, memory, and latency requirements. The max-pooling loss training can be further guided by initializing with a cross-entropy loss trained network. A posterior smoothing based evaluation approach is employed to measure keyword spotting performance. Our experimental results show that LSTM models trained using cross-entropy loss or max-pooling loss outperform a cross-entropy loss trained baseline feed-forward Deep Neural Network (DNN). In addition, max-pooling loss trained LSTM with randomly initialized network performs better compared to cross-entropy loss trained LSTM. Finally, the max-pooling loss trained LSTM initialized with a cross-entropy pre-trained network shows the best performance, which yields $67.6\%$ relative reduction compared to baseline feed-forward DNN in Area Under the Curve (AUC) measure.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.02411/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1705.02411/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/1705.02411/full.md

---
Source: https://tomesphere.com/paper/1705.02411