Fast-OverlaPIM: A Fast Overlap-driven Mapping Framework for Processing In-Memory Neural Network Acceleration
Xuan Wang, Minxuan Zhou, Tajana Rosing

TL;DR
Fast-OverlaPIM introduces a novel framework for optimizing in-memory neural network acceleration by leveraging overlapping execution of layers, significantly improving mapping speed and performance over previous methods.
Contribution
It presents analytical algorithms and a new search strategy for efficient overlap-based mapping exploration in PIM neural network accelerators.
Findings
Achieves up to 323.1x faster runtime performance than previous frameworks.
Produces mappings 4.6x to 18.1x faster than state-of-the-art optimization methods.
Demonstrates significant improvements in in-memory NN acceleration efficiency.
Abstract
Processing in-memory (PIM) is promising to accelerate neural networks (NNs) because it minimizes data movement and provides large computational parallelism. Similar to machine learning accelerators, application mapping, which determines the operation scheduling and data layout, plays a critical role in the NN acceleration on PIM. The mapping optimization of previous NN accelerators focused on optimizing the latency of sequential execution. However, PIM accelerators feature a distinct design space of application mapping from conventional NN accelerators, due to the spatial execution of NN layers across different memory locations. This enables opportunities for overlapping execution of consecutive NN layers to improve the latency, where the succeeding layer can start execution before the preceding layer fully completes the computation. In this paper, we propose Fast-OverlaPIM framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Brain Tumor Detection and Classification · Advanced Neural Network Applications
