EPIM: Efficient Processing-In-Memory Accelerators based on Epitome
Chenyu Wang, Zhen Dong, Daquan Zhou, Zhenhua Zhu, Yu Wang, Jiashi, Feng, Kurt Keutzer

TL;DR
This paper introduces EPIM, a novel neural operator and design methodology for PIM accelerators that improves memory efficiency and reduces area, enabling effective large-scale neural network deployment.
Contribution
The paper presents Epitome, a lightweight neural operator tailored for PIM, along with software and hardware optimizations, to enhance neural network processing efficiency on PIM accelerators.
Findings
3-bit quantized EPIM-ResNet50 achieves 71.59% top-1 accuracy on ImageNet.
EPIM reduces crossbar areas by 30.65 times.
EPIM outperforms state-of-the-art pruning methods on PIM.
Abstract
The utilization of large-scale neural networks on Processing-In-Memory (PIM) accelerators encounters challenges due to constrained on-chip memory capacity. To tackle this issue, current works explore model compression algorithms to reduce the size of Convolutional Neural Networks (CNNs). Most of these algorithms either aim to represent neural operators with reduced-size parameters (e.g., quantization) or search for the best combinations of neural operators (e.g., neural architecture search). Designing neural operators to align with PIM accelerators' specifications is an area that warrants further study. In this paper, we introduce the Epitome, a lightweight neural operator offering convolution-like functionality, to craft memory-efficient CNN operators for PIM accelerators (EPIM). On the software side, we evaluate epitomes' latency and energy on PIM accelerators and introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Microfluidic and Catalytic Techniques Innovation · Advanced Memory and Neural Computing · Machine Learning in Bioinformatics
MethodsPruning · ALIGN
