Resource-efficient DNNs for Keyword Spotting using Neural Architecture   Search and Quantization

David Peter; Wolfgang Roth; Franz Pernkopf

arXiv:2012.10138·eess.AS·December 21, 2020

Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

David Peter, Wolfgang Roth, Franz Pernkopf

PDF

2 Repos

TL;DR

This paper presents a neural architecture search and quantization approach to develop highly efficient convolutional neural networks for keyword spotting, achieving high accuracy with minimal resource consumption.

Contribution

The paper introduces a differentiable NAS method combined with weight quantization to automatically design small, accurate CNN models for keyword spotting in resource-constrained environments.

Findings

01

Achieved 95.4% accuracy with 494.8 kB memory and 19.6 million operations.

02

Weight quantization to low bit-widths maintains accuracy with reduced memory.

03

Increasing input features improves accuracy to 96.3% with lower resource usage.

Abstract

This paper introduces neural architecture search (NAS) for the automatic discovery of small models for keyword spotting (KWS) in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) to maximize the classification accuracy while minimizing the number of operations per inference. Using NAS only, we were able to obtain a highly efficient model with 95.4% accuracy on the Google speech commands dataset with 494.8 kB of memory usage and 19.6 million operations. Additionally, weight quantization is used to reduce the memory consumption even further. We show that weight quantization to low bit-widths (e.g. 1 bit) can be used without substantial loss in accuracy. By increasing the number of input features from 10 MFCC to 20 MFCC we were able to increase the accuracy to 96.3% at 340.1 kB of memory usage and 27.1…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDifferentiable Neural Architecture Search