MCUNet: Tiny Deep Learning on IoT Devices
Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, Song Han

TL;DR
MCUNet enables efficient ImageNet-scale inference on microcontrollers by jointly designing neural architectures and inference engines, achieving high accuracy with significantly reduced memory and computation requirements.
Contribution
The paper introduces MCUNet, a novel framework combining TinyNAS and TinyEngine for resource-constrained microcontrollers, enabling large-scale image recognition.
Findings
Achieves >70% ImageNet top-1 accuracy on microcontrollers.
Reduces memory usage by 4.8x and accelerates inference by up to 3.3x.
Outperforms existing solutions in accuracy and efficiency on wake word tasks.
Abstract
Machine learning on tiny IoT devices based on microcontroller units (MCU) is appealing but challenging: the memory of microcontrollers is 2-3 orders of magnitude smaller even than mobile phones. We propose MCUNet, a framework that jointly designs the efficient neural architecture (TinyNAS) and the lightweight inference engine (TinyEngine), enabling ImageNet-scale inference on microcontrollers. TinyNAS adopts a two-stage neural architecture search approach that first optimizes the search space to fit the resource constraints, then specializes the network architecture in the optimized search space. TinyNAS can automatically handle diverse constraints (i.e.device, latency, energy, memory) under low search costs.TinyNAS is co-designed with TinyEngine, a memory-efficient inference library to expand the search space and fit a larger model. TinyEngine adapts the memory scheduling according to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Machine Learning and Data Classification
MethodsPointwise Convolution · Depthwise Convolution · Batch Normalization · Depthwise Separable Convolution · Inverted Residual Block · 1x1 Convolution · Average Pooling · Convolution · Tether Customer Service Number +1-833-534-1729
