PIMCOMP: An End-to-End DNN Compiler for Processing-In-Memory Accelerators
Xiaotian Sun, Xinyu Wang, Wanqian Li, Yinhe Han, Xiaoming Chen

TL;DR
PIMCOMP is an end-to-end compiler that enables efficient deployment of deep neural networks on processing-in-memory accelerators, optimizing resource utilization and dataflow scheduling for diverse hardware architectures.
Contribution
It introduces a configurable abstraction and multi-level optimization framework for automatic DNN deployment on PIM accelerators, addressing resource and dataflow challenges.
Findings
Improves throughput, latency, and energy efficiency across various PIM architectures.
Supports flexible convolutional layer partitioning and resource mapping.
Enhances system performance through tailored dataflow scheduling algorithms.
Abstract
Various processing-in-memory (PIM) accelerators based on various devices, micro-architectures, and interfaces have been proposed to accelerate deep neural networks (DNNs). How to deploy DNNs onto PIM-based accelerators is the key to explore PIM's high performance and energy efficiency. The scale of DNN models, the diversity of PIM accelerators, and the complexity of deployment are far beyond the human deployment capability. Hence, an automatic deployment methodology is indispensable. In this work, we propose PIMCOMP, an end-to-end DNN compiler tailored for PIM accelerators, achieving efficient deployment of DNN models on PIM hardware. PIMCOMP can adapt to various PIM architectures by using an abstract configurable PIM accelerator template with a set of pseudo-instructions, which is a high-level abstraction of the hardware's fundamental functionalities. Through a generic multi-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Advanced Memory and Neural Computing
