Continual Learning Approach for Improving the Data and Computation Mapping in Near-Memory Processing System
Pritam Majumder, Jiayi Huang, Sungkeun Kim, Abdullah Muzahid, Dylan, Siegers, Chia-Che Tsai, and Eun Jung Kim

TL;DR
This paper introduces AIMM, an AI-driven adaptive memory mapping scheme for near-memory processing systems that learns optimal data placement to significantly enhance system performance.
Contribution
AIMM is a novel reinforcement learning-based memory mapping approach that dynamically optimizes data placement in NMP systems, addressing application-specific challenges.
Findings
AIMM improves NMP performance by up to 70% in single-program scenarios.
AIMM enhances performance by up to 50% in multi-program scenarios.
The proposed hardware module can be integrated into existing NMP systems.
Abstract
The resurgence of near-memory processing (NMP) with the advent of big data has shifted the computation paradigm from processor-centric to memory-centric computing. To meet the bandwidth and capacity demands of memory-centric computing, 3D memory has been adopted to form a scalable memory-cube network. Along with NMP and memory system development, the mapping for placing data and guiding computation in the memory-cube network has become crucial in driving the performance improvement in NMP. However, it is very challenging to design a universal optimal mapping for all applications due to unique application behavior and intractable decision space. In this paper, we propose an artificially intelligent memory mapping scheme, AIMM, that optimizes data placement and resource utilization through page and computation remapping. Our proposed technique involves continuously evaluating and learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Network Packet Processing and Optimization · Neural Networks and Applications
