Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks
Jaewook Lee, Yoel Park, Seulki Lee

TL;DR
This paper presents a novel CNN design that drastically reduces memory usage to 63 KB, enabling on-device vision tasks on low-end embedded devices while maintaining competitive accuracy.
Contribution
It introduces three design principles—input segmentation, patch tunneling, and bottleneck reordering—that significantly lower CNN memory requirements.
Findings
Achieves 61.58% top-1 accuracy on ImageNet with 63 KB memory.
Memory usage is up to 89x smaller than MobileNet.
Outperforms existing memory-efficient networks in resource-constrained environments.
Abstract
In this paper, we introduce a memory-efficient CNN (convolutional neural network), which enables resource-constrained low-end embedded and IoT devices to perform on-device vision tasks, such as image classification and object detection, using extremely low memory, i.e., only 63 KB on ImageNet classification. Based on the bottleneck block of MobileNet, we propose three design principles that significantly curtail the peak memory usage of a CNN so that it can fit the limited KB memory of the low-end device. First, 'input segmentation' divides an input image into a set of patches, including the central patch overlapped with the others, reducing the size (and memory requirement) of a large input image. Second, 'patch tunneling' builds independent tunnel-like paths consisting of multiple bottleneck blocks per patch, penetrating through the entire model from an input patch to the last layer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · CCD and CMOS Imaging Sensors
MethodsSparse Evolutionary Training · Convolution
