PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition
Boris Bergsma, Minhao Yang, Milos Cernak

TL;DR
This paper introduces power-efficient analog acoustic features (PEAF) validated by CMOS chips, enabling more power-efficient audio recognition in wearable devices by leveraging analog processing and a new information-theoretic analysis.
Contribution
The paper presents novel learnable analog acoustic features and a theoretical framework for analyzing information flow, improving power efficiency and accuracy in audio recognition.
Findings
Higher power efficiency compared to digital features
Achieved up to 7% accuracy improvement in keyword spotting
Validated by fabricated CMOS chips for real-world application
Abstract
At the end of Moore's law, new computing paradigms are required to prolong the battery life of wearable and IoT smart audio devices. Theoretical analysis and physical validation have shown that analog signal processing (ASP) can be more power-efficient than its digital counterpart in the realm of low-to-medium signal-to-noise ratio applications. In addition, ASP allows a direct interface with an analog microphone without a power-hungry analog-to-digital converter. Here, we present power-efficient analog acoustic features (PEAF) that are validated by fabricated CMOS chips for running audio recognition. Linear, non-linear, and learnable PEAF variants are evaluated on two speech processing tasks that are demanded in many battery-operated devices: wake word detection (WWD) and keyword spotting (KWS). Compared to digital acoustic features, higher power efficiency with competitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Blind Source Separation Techniques
