MinUn: Accurate ML Inference on Microcontrollers

Shikhar Jaiswal; Rahul Kiran Kranti Goli; Aayan Kumar; Vivek Seshadri; and Rahul Sharma

arXiv:2210.16556·cs.LG·December 1, 2022

MinUn: Accurate ML Inference on Microcontrollers

Shikhar Jaiswal, Rahul Kiran Kranti Goli, Aayan Kumar, Vivek Seshadri, and Rahul Sharma

PDF

Open Access

TL;DR

MinUn is a novel TinyML framework that efficiently generates memory-frugal, high-precision inference code for microcontrollers, supporting emerging number representations and optimizing memory usage.

Contribution

It is the first TinyML framework to holistically address parametric number representations, tensor precision assignment, and memory fragmentation for microcontrollers.

Findings

01

Outperforms prior TinyML frameworks in efficiency

02

Supports emerging number representations like posits

03

Generates code for popular ARM microcontrollers

Abstract

Running machine learning inference on tiny devices, known as TinyML, is an emerging research area. This task requires generating inference code that uses memory frugally, a task that standard ML frameworks are ill-suited for. A deployment framework for TinyML must be a) parametric in the number representation to take advantage of the emerging representations like posits, b) carefully assign high-precision to a few tensors so that most tensors can be kept in low-precision while still maintaining model accuracy, and c) avoid memory fragmentation. We describe MinUn, the first TinyML framework that holistically addresses these issues to generate efficient code for ARM microcontrollers (e.g., Arduino Uno, Due and STM32H747) that outperforms the prior TinyML frameworks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Neural Networks and Applications · Machine Learning and Data Classification