TL;DR
This paper presents a neural architecture search and quantization approach to develop highly efficient convolutional neural networks for keyword spotting, achieving high accuracy with minimal resource consumption.
Contribution
The paper introduces a differentiable NAS method combined with weight quantization to automatically design small, accurate CNN models for keyword spotting in resource-constrained environments.
Findings
Achieved 95.4% accuracy with 494.8 kB memory and 19.6 million operations.
Weight quantization to low bit-widths maintains accuracy with reduced memory.
Increasing input features improves accuracy to 96.3% with lower resource usage.
Abstract
This paper introduces neural architecture search (NAS) for the automatic discovery of small models for keyword spotting (KWS) in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) to maximize the classification accuracy while minimizing the number of operations per inference. Using NAS only, we were able to obtain a highly efficient model with 95.4% accuracy on the Google speech commands dataset with 494.8 kB of memory usage and 19.6 million operations. Additionally, weight quantization is used to reduce the memory consumption even further. We show that weight quantization to low bit-widths (e.g. 1 bit) can be used without substantial loss in accuracy. By increasing the number of input features from 10 MFCC to 20 MFCC we were able to increase the accuracy to 96.3% at 340.1 kB of memory usage and 27.1…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDifferentiable Neural Architecture Search
