Understanding and Improving the Latency of DRAM-Based Memory Systems
Kevin K. Chang

TL;DR
This paper investigates the persistent latency issues in DRAM-based memory systems, proposing new architectural techniques and mechanisms to reduce latency and improve energy efficiency, supported by experimental data on real chips.
Contribution
It introduces novel low-cost architectural techniques and explores the voltage-latency trade-off to enhance DRAM performance and energy efficiency.
Findings
Significant latency reduction achieved through new techniques
Improved energy efficiency via voltage-latency trade-off
Experimental validation on real commodity DRAM chips
Abstract
Over the past two decades, the storage capacity and access bandwidth of main memory have improved tremendously, by 128x and 20x, respectively. These improvements are mainly due to the continuous technology scaling of DRAM (dynamic random-access memory), which has been used as the physical substrate for main memory. In stark contrast with capacity and bandwidth, DRAM latency has remained almost constant, reducing by only 1.3x in the same time frame. Therefore, long DRAM latency continues to be a critical performance bottleneck in modern systems. Increasing core counts, and the emergence of increasingly more data-intensive and latency-critical applications further stress the importance of providing low-latency memory access. In this dissertation, we identify three main problems that contribute significantly to long latency of DRAM accesses. To address these problems, we present a series…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Low-power high-performance VLSI design · Advanced Memory and Neural Computing
