Reducing Load Latency with Cache Level Prediction

Majid Jalili; Mattan Erez

arXiv:2103.14808·cs.AR·March 30, 2021

Reducing Load Latency with Cache Level Prediction

Majid Jalili, Mattan Erez

PDF

Open Access 1 Repo

TL;DR

This paper introduces a cache-level prediction method that accurately forecasts memory hierarchy levels accessed by loads, enabling earlier data fetching and reducing load latency, resulting in notable performance improvements.

Contribution

It presents a novel cache-level predictor that complements prefetchers by accurately predicting memory hierarchy levels accessed, improving load latency and overall performance.

Findings

01

Achieves 7.8% speedup on various applications.

02

Provides high prediction accuracy with minimal added latency.

03

Effectively reduces load latency in deep cache hierarchies.

Abstract

High load latency that results from deep cache hierarchies and relatively slow main memory is an important limiter of single-thread performance. Data prefetch helps reduce this latency by fetching data up the hierarchy before it is requested by load instructions. However, data prefetching has shown to be imperfect in many situations. We propose cache-level prediction to complement prefetchers. Our method predicts which memory hierarchy level a load will access allowing the memory loads to start earlier, and thereby saves many cycles. The predictor provides high prediction accuracy at the cost of just one cycle added latency to L1 misses. Experimental results show speedup of 7.8\% on generic, graph, and HPC applications over a baseline with aggressive prefetchers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cmu-safari/hermes
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Cloud Computing and Resource Management · Advanced Data Storage Technologies