A Study of Runtime Adaptive Prefetching for STTRAM L1 Caches

Kyle Kuan; Tosiron Adegbija

arXiv:2009.11442·cs.AR·September 25, 2020

A Study of Runtime Adaptive Prefetching for STTRAM L1 Caches

Kyle Kuan, Tosiron Adegbija

PDF

Open Access

TL;DR

This paper investigates runtime adaptive prefetching strategies for STTRAM L1 caches, demonstrating significant energy and latency reductions through retention time tuning and prefetch control based on application behavior.

Contribution

It introduces novel metrics and methods for adaptive retention time and prefetch control in STTRAM caches, improving energy efficiency and latency performance.

Findings

01

Reduced cache energy by 22.24% with proposed methods.

02

Lowered cache latency by 24.59% using adaptive techniques.

03

Achieved over 54% hardware overhead reduction with combined strategies.

Abstract

Spin-Transfer Torque RAM (STTRAM) is a promising alternative to SRAM in on-chip caches due to several advantages. These advantages include non-volatility, low leakage, high integration density, and CMOS compatibility. Prior studies have shown that relaxing and adapting the STTRAM retention time to runtime application needs can substantially reduce overall cache energy without significant latency overheads, due to the lower STTRAM write energy and latency in shorter retention times. In this paper, as a first step towards efficient prefetching across the STTRAM cache hierarchy, we study prefetching in reduced retention STTRAM L1 caches. Using SPEC CPU 2017 benchmarks, we analyze the energy and latency impact of different prefetch distances in different STTRAM cache retention times for different applications. We show that expired_unused_prefetches---the number of unused prefetches expired…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Advanced Memory and Neural Computing