A Cascade Architecture for Keyword Spotting on Mobile Devices

Alexander Gruenstein; Raziel Alvarez; Chris Thornton; Mohammadali; Ghodrat

arXiv:1712.03603·cs.SD·December 12, 2017·33 cites

A Cascade Architecture for Keyword Spotting on Mobile Devices

Alexander Gruenstein, Raziel Alvarez, Chris Thornton, Mohammadali, Ghodrat

PDF

Open Access

TL;DR

This paper introduces a cascade architecture for keyword spotting on mobile devices that combines low computational cost with DSP chips to enable continuous listening with minimal power use.

Contribution

The paper proposes a novel cascade architecture tailored for mobile keyword spotting that optimally balances accuracy and power efficiency.

Findings

01

Achieves low power consumption suitable for mobile devices.

02

Enables continuous listening for keywords.

03

Utilizes specialized DSP chips for efficiency.

Abstract

We present a cascade architecture for keyword spotting with speaker verification on mobile devices. By pairing a small computational footprint with specialized digital signal processing (DSP) chips, we are able to achieve low power consumption while continuously listening for a keyword.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing