Proxics: an efficient programming model for far memory accelerators
Zikai Liu, Niels Pressel, Jasmin Schult, Roman Meier, Pengcheng Xu, Timothy Roscoe

TL;DR
Proxics introduces a lightweight, portable programming model for near-data processing accelerators based on OS abstractions, optimized for hardware with limited processing power and designed to reduce memory bandwidth.
Contribution
It proposes a novel, efficient implementation of OS-like abstractions for NDP accelerators, enabling portable programming and emphasizing low-latency communication.
Findings
Demonstrated benefits over CPU-only implementations.
Showed effectiveness with various memory access patterns.
Highlighted the importance of efficient communication channels.
Abstract
The use of disaggregated or far memory systems such as CXL memory pools has renewed interest in Near-Data Processing (NDP): situating cores close to memory to reduce bandwidth requirements to and from the CPU. Hardware designs for such accelerators are appearing, but there lack clean, portable OS abstractions for programming them. We propose a programming model for NDP devices based on familiar OS abstractions: virtual processors (processes) and inter-process communication channels (like Unix pipes). While appealing from a user perspective, a naive implementation of such abstractions is inappropriate for NDP accelerators: the paucity of processing power in some hardware designs makes classical processes overly heavyweight, and IPC based on shared buffers makes no sense in a system designed to reduce memory bandwidth. Accordingly, we show how to implement these abstractions in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
