RNC: Efficient RRAM-aware NAS and Compilation for DNNs on Resource-Constrained Edge Devices
Kam Chi Loong, Shihao Han, Sishuo Liu, Ning Lin, Zhongrui Wang

TL;DR
This paper introduces an RRAM-aware neural architecture search and compilation framework tailored for resource-constrained edge devices, significantly improving neural network efficiency and speed on resistive memory-based accelerators.
Contribution
It proposes a novel edge compilation and NAS framework that optimizes DNNs for RRAM-based accelerators under hardware constraints, enhancing utilization and speed.
Findings
Over 80% hardware utilization achieved.
Over 6x speedup compared to baseline frameworks.
Speed improvements of 5x-30x for optimized models.
Abstract
Computing-in-memory (CIM) is an emerging computing paradigm, offering noteworthy potential for accelerating neural networks with high parallelism, low latency, and energy efficiency compared to conventional von Neumann architectures. However, existing research has primarily focused on hardware architecture and network co-design for large-scale neural networks, without considering resource constraints. In this study, we aim to develop edge-friendly deep neural networks (DNNs) for accelerators based on resistive random-access memory (RRAM). To achieve this, we propose an edge compilation and resource-constrained RRAM-aware neural architecture search (NAS) framework to search for optimized neural networks meeting specific hardware constraints. Our compilation approach integrates layer partitioning, duplication, and network packing to maximize the utilization of computation units. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Advanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · ReLU6 · Batch Normalization · Hard Swish · 1x1 Convolution · Dense Connections · Sigmoid Activation
