Pex: Memory-efficient Microcontroller Deep Learning through Partial Execution
Edgar Liberis, Nicholas D. Lane

TL;DR
Pex introduces a novel partial execution method for microcontroller deep learning, significantly reducing SRAM usage without sacrificing accuracy by exploiting operator properties and co-designing network architecture.
Contribution
The paper presents Pex, a compiler that automatically generates memory-efficient execution schedules for neural networks on MCUs, enabling larger models and higher resolution inputs.
Findings
Achieves up to 2.9% accuracy increase with low SRAM usage
Reduces memory by 4x with partial execution, 10.5x with co-design
Enables processing higher resolution inputs, improving accuracy
Abstract
Embedded and IoT devices, largely powered by microcontroller units (MCUs), could be made more intelligent by leveraging on-device deep learning. One of the main challenges of neural network inference on an MCU is the extremely limited amount of read-write on-chip memory (SRAM, < 512 kB). SRAM is consumed by the neural network layer (operator) input and output buffers, which, traditionally, must be in memory (materialised) for an operator to execute. We discuss a novel execution paradigm for microcontroller deep learning, which modifies the execution of neural networks to avoid materialising full buffers in memory, drastically reducing SRAM usage with no computation overhead. This is achieved by exploiting the properties of operators, which can consume/produce a fraction of their input/output at a time. We describe a partial execution compiler, Pex, which produces memory-efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
