MCUNet: Tiny Deep Learning on IoT Devices

Ji Lin; Wei-Ming Chen; Yujun Lin; John Cohn; Chuang Gan; Song Han

arXiv:2007.10319·cs.CV·November 20, 2020·257 cites

MCUNet: Tiny Deep Learning on IoT Devices

Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, Song Han

PDF

Open Access 1 Repo 1 Video

TL;DR

MCUNet enables efficient ImageNet-scale inference on microcontrollers by jointly designing neural architectures and inference engines, achieving high accuracy with significantly reduced memory and computation requirements.

Contribution

The paper introduces MCUNet, a novel framework combining TinyNAS and TinyEngine for resource-constrained microcontrollers, enabling large-scale image recognition.

Findings

01

Achieves >70% ImageNet top-1 accuracy on microcontrollers.

02

Reduces memory usage by 4.8x and accelerates inference by up to 3.3x.

03

Outperforms existing solutions in accuracy and efficiency on wake word tasks.

Abstract

Machine learning on tiny IoT devices based on microcontroller units (MCU) is appealing but challenging: the memory of microcontrollers is 2-3 orders of magnitude smaller even than mobile phones. We propose MCUNet, a framework that jointly designs the efficient neural architecture (TinyNAS) and the lightweight inference engine (TinyEngine), enabling ImageNet-scale inference on microcontrollers. TinyNAS adopts a two-stage neural architecture search approach that first optimizes the search space to fit the resource constraints, then specializes the network architecture in the optimized search space. TinyNAS can automatically handle diverse constraints (i.e.device, latency, energy, memory) under low search costs.TinyNAS is co-designed with TinyEngine, a memory-efficient inference library to expand the search space and fit a larger model. TinyEngine adapts the memory scheduling according to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mit-han-lab/mcunet
tf

Videos

MCUNet: Tiny Deep Learning on IoT Devices· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Machine Learning and Data Classification

MethodsPointwise Convolution · Depthwise Convolution · Batch Normalization · Depthwise Separable Convolution · Inverted Residual Block · 1x1 Convolution · Average Pooling · Convolution · Tether Customer Service Number +1-833-534-1729