Five-Minute Rule 40 Years Later: A First-Principles Revisit for Modern Memory Hierarchy
Tong Zhang, Vikram Sharma Mailthody, Fei Sun, Linsen Ma, Chris J. Newburn, Teresa Zhang, Yang Liu, Jiangpeng Li, Hao Zhong, Wen-Mei Hwu

TL;DR
This paper revisits the five-minute rule for memory hierarchy, integrating modern hardware costs and performance models to provide actionable guidance for AI platforms, especially with GPU and SSD advancements.
Contribution
It extends the classical heuristic into a first-principles, workload-aware framework tailored for modern AI hardware and introduces MQSim-Next, a new SSD simulation tool.
Findings
DRAM-SSD caching threshold drops from minutes to seconds for AI workloads.
NAND flash memory acts as an active data tier in modern systems.
The framework enables new hardware-software co-design research avenues.
Abstract
In 1987, Jim Gray and Gianfranco Putzolu introduced the five-minute rule, a simple, storage-memory-economics-based heuristic for deciding when data should live in DRAM rather than on storage. Subsequent revisits to the rule largely retained that economics-only view, leaving host costs, feasibility limits, and workload behavior out of scope. This paper revisits the rule from first principles, integrating host costs, DRAM bandwidth/capacity, and physics-grounded models of SSD performance and cost, and then embedding these elements in a constraint- and workload-aware framework that yields actionable provisioning guidance. We show that, for modern AI platforms, especially GPU-centric hosts paired with ultra-high-IOPS SSDs engineered for fine-grained random access, the DRAMflash caching threshold collapses from minutes to a few seconds. This shift reframes NAND flash memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
