Machine Learning Training on a Real Processing-in-Memory System
Juan G\'omez-Luna, Yuxin Guo, Sylvan Brocard, Julien Legriel, Remy, Cimadomo, Geraldo F. Oliveira, Gagandeep Singh, Onur Mutlu

TL;DR
This paper evaluates the potential of general-purpose processing-in-memory (PIM) systems to accelerate machine learning training, demonstrating significant performance improvements over traditional CPU and GPU implementations on real hardware.
Contribution
It is the first to implement and evaluate training of multiple machine learning algorithms on a real-world general-purpose PIM architecture, highlighting its advantages.
Findings
PIM architectures can greatly accelerate memory-bound ML workloads
Native support of operations and datatypes in PIM hardware is crucial
Experimental results show significant performance gains over CPU and GPU
Abstract
Training machine learning algorithms is a computationally intensive process, which is frequently memory-bound due to repeatedly accessing large training datasets. As a result, processor-centric systems (e.g., CPU, GPU) suffer from costly data movement between memory units and processing units, which consumes large amounts of energy and execution cycles. Memory-centric computing systems, i.e., computing systems with processing-in-memory (PIM) capabilities, can alleviate this data movement bottleneck. Our goal is to understand the potential of modern general-purpose PIM architectures to accelerate machine learning training. To do so, we (1) implement several representative classic machine learning algorithms (namely, linear regression, logistic regression, decision tree, K-means clustering) on a real-world general-purpose PIM architecture, (2) characterize them in terms of accuracy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing · Machine Learning in Materials Science
