A Method for Hiding the Increased Non-Volatile Cache Read Latency
Apostolos Kokolis, Namrata Mantri, Shrikanth Ganapathy, Josep, Torrellas, John Kalamatianos

TL;DR
This paper introduces Cloak, a method that leverages data reuse at the page level to hide the high read latency of NVM-based last level caches, improving performance with minimal area overhead.
Contribution
Cloak is a novel cache design that uses page-level data transfer to mitigate NVM read latency, outperforming traditional SRAM and NVM-only caches.
Findings
Cloak improves performance by 23.8% over SRAM LLC.
Cloak reduces energy delay product by 39.9%.
Cloak achieves these gains with negligible area increase.
Abstract
The increased memory demands of workloads is putting high pressure on Last Level Caches (LLCs). Unfortunately, there is limited opportunity to increase the capacity of LLCs due to the area and power requirements of the underlying SRAM technology. Interestingly, emerging Non-Volatile Memory (NVM) technologies promise a feasible alternative to SRAM for LLCs due to their higher area density. However, NVMs have substantially higher read and write latencies, which offset their area density benefit. Although researchers have proposed methods to tolerate NVM's increased write latency, little emphasis has been placed on reducing the critical NVM read latency. To address this problem, this paper proposes Cloak. Cloak exploits data reuse in the LLC at the page level, to hide NVM read latency. Specifically, on certain L1 TLB misses to a page, Cloak transfers LLC-resident data belonging to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Parallel Computing and Optimization Techniques · Caching and Content Delivery
