CoMoNM: A Cost Modeling Framework for Compute-Near-Memory Systems
Hamid Farzaneh, Asif Ali Khan, Jeronimo Castrillon

TL;DR
CoMoNM is a fast, generic cost modeling framework that estimates execution time for compute-near-memory systems, aiding optimization and design decisions with high accuracy and significantly reduced simulation time.
Contribution
It introduces CoMoNM, a novel, hardware-agnostic cost model for CNM systems that enables rapid execution time estimation to improve compiler offloading decisions.
Findings
Estimation errors within 7.80% and 2.99% compared to real systems.
Provides execution time estimates seven orders of magnitude faster than detailed simulators.
Seamlessly integrates into CNM compilers for improved optimization.
Abstract
Compute-Near-Memory (CNM) systems offer a promising approach to mitigate the von Neumann bottleneck by bringing computational units closer to data. However, optimizing for these architectures remains challenging due to their unique hardware and programming models. Existing CNM compilers often rely on manual programmer annotations for offloading and optimizations. Automating these decisions by exploring the optimization space, common in CPU/GPU systems, is difficult for CNMs as constructing and navigating the transformation space is tedious and time consuming. This is particularly the case during system-level design, where evaluation requires time-consuming simulations. To address this, we present CoMoNM, a generic cost modeling framework for CNM systems for execution time estimation in milliseconds. It takes a high-level, hardware-agnostic application representation, target system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
