Gemini: Reducing DRAM Cache Hit Latency by Hybrid Mappings
Ye Chi

TL;DR
Gemini is a hybrid DRAM cache design that combines static and dynamic mappings to reduce hit latency and maintain high hit rates, outperforming existing cache architectures.
Contribution
The paper introduces Gemini, a novel hybrid mapping DRAM cache that classifies data blocks and applies different mappings, achieving both low latency and high hit rate.
Findings
Reduces DRAM cache hit latency from 1.75X to 1.22X of direct-mapped cache.
Achieves comparable hit rate to set-associative cache.
Improves IPC by up to 20% over state-of-the-art baselines.
Abstract
Die-stacked DRAM caches are increasingly advocated to bridge the performance gap between on-chip Cache and main memory. It is essential to improve DRAM cache hit rate and lower cache hit latency simultaneously. Prior DRAM cache designs fall into two categories according to the data mapping polices: set-associative and direct-mapped, achieving either one. In this paper, we propose a partial direct-mapped die-stacked DRAM cache to achieve the both objectives simultaneously, called Gemini, which is motivated by the following observations: applying unified mapping policy to different blocks cannot achieve high cache hit rate and low hit latency in terms of mapping structure. Gemini cache classifies data into leading blocks and following blocks, and places them with static mapping and dynamic mapping respectively in a unified set-associative structure. Gemini also designs a replacement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Interconnection Networks and Systems
