PIMCOMP: An End-to-End DNN Compiler for Processing-In-Memory   Accelerators

Xiaotian Sun; Xinyu Wang; Wanqian Li; Yinhe Han; Xiaoming Chen

arXiv:2411.09159·cs.AR·November 15, 2024

PIMCOMP: An End-to-End DNN Compiler for Processing-In-Memory Accelerators

Xiaotian Sun, Xinyu Wang, Wanqian Li, Yinhe Han, Xiaoming Chen

PDF

Open Access 1 Repo

TL;DR

PIMCOMP is an end-to-end compiler that enables efficient deployment of deep neural networks on processing-in-memory accelerators, optimizing resource utilization and dataflow scheduling for diverse hardware architectures.

Contribution

It introduces a configurable abstraction and multi-level optimization framework for automatic DNN deployment on PIM accelerators, addressing resource and dataflow challenges.

Findings

01

Improves throughput, latency, and energy efficiency across various PIM architectures.

02

Supports flexible convolutional layer partitioning and resource mapping.

03

Enhances system performance through tailored dataflow scheduling algorithms.

Abstract

Various processing-in-memory (PIM) accelerators based on various devices, micro-architectures, and interfaces have been proposed to accelerate deep neural networks (DNNs). How to deploy DNNs onto PIM-based accelerators is the key to explore PIM's high performance and energy efficiency. The scale of DNN models, the diversity of PIM accelerators, and the complexity of deployment are far beyond the human deployment capability. Hence, an automatic deployment methodology is indispensable. In this work, we propose PIMCOMP, an end-to-end DNN compiler tailored for PIM accelerators, achieving efficient deployment of DNN models on PIM hardware. PIMCOMP can adapt to various PIM architectures by using an abstract configurable PIM accelerator template with a set of pseudo-instructions, which is a high-level abstraction of the hardware's fundamental functionalities. Through a generic multi-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sunxt99/pimcomp-nn
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Advanced Memory and Neural Computing