SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers
Igor Fedorov, Ryan P. Adams, Matthew Mattina, Paul N. Whatmough

TL;DR
This paper introduces SpArSe, a unified neural architecture search and pruning method that designs small, accurate CNNs suitable for deployment on resource-limited microcontrollers, advancing IoT applications.
Contribution
It presents a novel combined neural architecture search and pruning approach that produces CNNs optimized for microcontroller constraints, outperforming previous methods in size and accuracy.
Findings
CNNs designed are up to 4.35 times smaller than previous models.
The method achieves higher accuracy on IoT datasets.
Designed CNNs meet strict MCU memory constraints.
Abstract
The vast majority of processors in the world are actually microcontroller units (MCUs), which find widespread use performing simple control tasks in applications ranging from automobiles to medical devices and office equipment. The Internet of Things (IoT) promises to inject machine learning into many of these every-day objects via tiny, cheap MCUs. However, these resource-impoverished hardware platforms severely limit the complexity of machine learning models that can be deployed. For example, although convolutional neural networks (CNNs) achieve state-of-the-art results on many visual recognition tasks, CNN inference on MCUs is challenging due to severe finite memory limitations. To circumvent the memory challenge associated with CNNs, various alternatives have been proposed that do fit within the memory budget of an MCU, albeit at the cost of prediction accuracy. This paper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
MethodsPruning · Sigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory
