PEAF: Learnable Power Efficient Analog Acoustic Features for Audio   Recognition

Boris Bergsma; Minhao Yang; Milos Cernak

arXiv:2110.03715·eess.AS·March 30, 2022

PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition

Boris Bergsma, Minhao Yang, Milos Cernak

PDF

Open Access

TL;DR

This paper introduces power-efficient analog acoustic features (PEAF) validated by CMOS chips, enabling more power-efficient audio recognition in wearable devices by leveraging analog processing and a new information-theoretic analysis.

Contribution

The paper presents novel learnable analog acoustic features and a theoretical framework for analyzing information flow, improving power efficiency and accuracy in audio recognition.

Findings

01

Higher power efficiency compared to digital features

02

Achieved up to 7% accuracy improvement in keyword spotting

03

Validated by fabricated CMOS chips for real-world application

Abstract

At the end of Moore's law, new computing paradigms are required to prolong the battery life of wearable and IoT smart audio devices. Theoretical analysis and physical validation have shown that analog signal processing (ASP) can be more power-efficient than its digital counterpart in the realm of low-to-medium signal-to-noise ratio applications. In addition, ASP allows a direct interface with an analog microphone without a power-hungry analog-to-digital converter. Here, we present power-efficient analog acoustic features (PEAF) that are validated by fabricated CMOS chips for running audio recognition. Linear, non-linear, and learnable PEAF variants are evaluated on two speech processing tasks that are demanded in many battery-operated devices: wake word detection (WWD) and keyword spotting (KWS). Compared to digital acoustic features, higher power efficiency with competitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Blind Source Separation Techniques