Pex: Memory-efficient Microcontroller Deep Learning through Partial   Execution

Edgar Liberis; Nicholas D. Lane

arXiv:2211.17246·cs.LG·January 25, 2023·1 cites

Pex: Memory-efficient Microcontroller Deep Learning through Partial Execution

Edgar Liberis, Nicholas D. Lane

PDF

Open Access

TL;DR

Pex introduces a novel partial execution method for microcontroller deep learning, significantly reducing SRAM usage without sacrificing accuracy by exploiting operator properties and co-designing network architecture.

Contribution

The paper presents Pex, a compiler that automatically generates memory-efficient execution schedules for neural networks on MCUs, enabling larger models and higher resolution inputs.

Findings

01

Achieves up to 2.9% accuracy increase with low SRAM usage

02

Reduces memory by 4x with partial execution, 10.5x with co-design

03

Enables processing higher resolution inputs, improving accuracy

Abstract

Embedded and IoT devices, largely powered by microcontroller units (MCUs), could be made more intelligent by leveraging on-device deep learning. One of the main challenges of neural network inference on an MCU is the extremely limited amount of read-write on-chip memory (SRAM, < 512 kB). SRAM is consumed by the neural network layer (operator) input and output buffers, which, traditionally, must be in memory (materialised) for an operator to execute. We discuss a novel execution paradigm for microcontroller deep learning, which modifies the execution of neural networks to avoid materialising full buffers in memory, drastically reducing SRAM usage with no computation overhead. This is achieved by exploiting the properties of operators, which can consume/produce a fraction of their input/output at a time. We describe a partial execution compiler, Pex, which produces memory-efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications