Accelerating TinyML Inference on Microcontrollers through Approximate Kernels
Giorgos Armeniakos, Georgios Mentzos, Dimitrios Soudris

TL;DR
This paper presents a novel approximation framework combining approximate computing and software kernel design to accelerate CNN inference on microcontrollers, achieving significant latency reductions without accuracy loss.
Contribution
It introduces a kernel-based approximation method with significance-based computation skipping, optimizing TinyML inference on MCUs.
Findings
21% average latency reduction without accuracy loss
Effective approximation strategy for CNN inference on microcontrollers
Pareto optimal solutions balancing speed and accuracy
Abstract
The rapid growth of microcontroller-based IoT devices has opened up numerous applications, from smart manufacturing to personalized healthcare. Despite the widespread adoption of energy-efficient microcontroller units (MCUs) in the Tiny Machine Learning (TinyML) domain, they still face significant limitations in terms of performance and memory (RAM, Flash). In this work, we combine approximate computing and software kernel design to accelerate the inference of approximate CNN models on MCUs. Our kernel-based approximation framework firstly unpacks the operands of each convolution layer and then conducts an offline calculation to determine the significance of each operand. Subsequently, through a design space exploration, it employs a computation skipping approximation strategy based on the calculated significance. Our evaluation on an STM32-Nucleo board and 2 popular CNNs trained on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Human Pose and Action Recognition · Video Analysis and Summarization
MethodsConvolution
