Sound Event Detection with Binary Neural Networks on Tightly Power-Constrained IoT Devices
Gianmarco Cerutti, Renzo Andri, Lukas Cavigelli, Michele Magno,, Elisabetta Farella, Luca Benini

TL;DR
This paper presents a binary neural network for sound event detection on ultra-low power IoT devices, achieving high accuracy with significantly reduced memory and energy consumption.
Contribution
It demonstrates how to adapt and retrain a CNN for sound detection using binary neural networks on a RISC-V microcontroller, balancing accuracy and resource constraints.
Findings
Achieves 77.9% accuracy with 58 kB memory footprint.
Outperforms ARM Cortex-M4 in speed and energy efficiency.
Reaches 4.6 GMAC/s throughput with high energy efficiency.
Abstract
Sound event detection (SED) is a hot topic in consumer and smart city applications. Existing approaches based on Deep Neural Networks are very effective, but highly demanding in terms of memory, power, and throughput when targeting ultra-low power always-on devices. Latency, availability, cost, and privacy requirements are pushing recent IoT systems to process the data on the node, close to the sensor, with a very limited energy supply, and tight constraints on the memory size and processing capabilities precluding to run state-of-the-art DNNs. In this paper, we explore the combination of extreme quantization to a small-footprint binary neural network (BNN) with the highly energy-efficient, RISC-V-based (8+1)-core GAP8 microcontroller. Starting from an existing CNN for SED whose footprint (815 kB) exceeds the 512 kB of memory available on our platform, we retrain the network using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
