Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases
Geraldo F. Oliveira, Amirali Boroumand, Saugata Ghose, Juan, G\'omez-Luna, Onur Mutlu

TL;DR
This paper explores co-designing algorithms and hardware accelerators to optimize data-centric architectures, specifically processing-in-memory systems, for machine learning inference on edge devices and hybrid databases in cloud environments.
Contribution
It presents a systematic approach to analyze application patterns and co-design hardware and algorithms to enhance performance and energy efficiency in data-intensive applications.
Findings
Co-designed PIM accelerators improve ML inference efficiency on edge devices.
Optimized architectures reduce energy consumption in cloud database processing.
Application-specific insights guide hardware/software co-design for data-centric systems.
Abstract
Today's computing systems require moving data back-and-forth between computing resources (e.g., CPUs, GPUs, accelerators) and off-chip main memory so that computation can take place on the data. Unfortunately, this data movement is a major bottleneck for system performance and energy consumption. One promising execution paradigm that alleviates the data movement bottleneck in modern and emerging applications is processing-in-memory (PIM), where the cost of data movement to/from main memory is reduced by placing computation capabilities close to memory. Naively employing PIM to accelerate data-intensive workloads can lead to sub-optimal performance due to the many design constraints PIM substrates impose. Therefore, many recent works co-design specialized PIM accelerators and algorithms to improve performance and reduce the energy consumption of (i) applications from various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Stochastic Gradient Optimization Techniques
