Quantization and Deployment of Deep Neural Networks on Microcontrollers
Pierre-Emmanuel Novac (1), Ghouthi Boukli Hacene (2, 3), Alain, Pegatoquet (1), Beno\^it Miramond (1), Vincent Gripon (2) ((1) Universit\'e, C\^ote d'Azur, CNRS, LEAT, Sophia Antipolis, France, (2) IMT Atlantique,, Brest, France, (3) MILA, Montreal, Canada)

TL;DR
This paper introduces MicroAI, a flexible framework for quantizing and deploying deep neural networks on low-power microcontrollers, optimizing for power, memory, and ease of deployment across various use cases.
Contribution
The paper presents MicroAI, a novel end-to-end framework for training, quantizing, and deploying neural networks on microcontrollers, supporting multiple quantization schemes and outperforming existing inference engines.
Findings
MicroAI achieves comparable accuracy with existing methods.
It reduces memory usage and power consumption.
Supports multiple datasets and microcontroller platforms.
Abstract
Embedding Artificial Intelligence onto low-power devices is a challenging task that has been partly overcome with recent advances in machine learning and hardware design. Presently, deep neural networks can be deployed on embedded targets to perform different tasks such as speech recognition,object detection or Human Activity Recognition. However, there is still room for optimization of deep neural networks onto embedded devices. These optimizations mainly address power consumption,memory and real-time constraints, but also an easier deployment at the edge. Moreover, there is still a need for a better understanding of what can be achieved for different use cases. This work focuses on quantization and deployment of deep neural networks onto low-power 32-bit microcontrollers. The quantization methods, relevant in the context of an embedded execution onto a microcontroller, are first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
