CoMoNM: A Cost Modeling Framework for Compute-Near-Memory Systems

Hamid Farzaneh; Asif Ali Khan; Jeronimo Castrillon

arXiv:2508.11451·cs.ET·August 18, 2025

CoMoNM: A Cost Modeling Framework for Compute-Near-Memory Systems

Hamid Farzaneh, Asif Ali Khan, Jeronimo Castrillon

PDF

TL;DR

CoMoNM is a fast, generic cost modeling framework that estimates execution time for compute-near-memory systems, aiding optimization and design decisions with high accuracy and significantly reduced simulation time.

Contribution

It introduces CoMoNM, a novel, hardware-agnostic cost model for CNM systems that enables rapid execution time estimation to improve compiler offloading decisions.

Findings

01

Estimation errors within 7.80% and 2.99% compared to real systems.

02

Provides execution time estimates seven orders of magnitude faster than detailed simulators.

03

Seamlessly integrates into CNM compilers for improved optimization.

Abstract

Compute-Near-Memory (CNM) systems offer a promising approach to mitigate the von Neumann bottleneck by bringing computational units closer to data. However, optimizing for these architectures remains challenging due to their unique hardware and programming models. Existing CNM compilers often rely on manual programmer annotations for offloading and optimizations. Automating these decisions by exploring the optimization space, common in CPU/GPU systems, is difficult for CNMs as constructing and navigating the transformation space is tedious and time consuming. This is particularly the case during system-level design, where evaluation requires time-consuming simulations. To address this, we present CoMoNM, a generic cost modeling framework for CNM systems for execution time estimation in milliseconds. It takes a high-level, hardware-agnostic application representation, target system…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.