Arsenal of Hardware Prefetchers
Dishank Yadav, Chaitanya Paikara

TL;DR
Arsenal is a dynamic prefetching framework that combines multiple hardware prefetchers, selecting the most effective one in real-time to improve latency tolerance across diverse workloads.
Contribution
It introduces a novel framework that adaptively switches between different prefetchers using Bloom filters, enhancing versatility and performance over individual prefetchers.
Findings
Achieves 44.29% speedup on single-core workloads.
Provides 19.5% average improvement on multi-core workloads.
Outperforms individual prefetchers in diverse access patterns.
Abstract
Hardware prefetching is one of the latency tolerance optimization techniques that tolerate costly DRAM accesses. Though hardware prefetching is one of the fundamental mechanisms prevalent on most of the commercial machines, there is no prefetching technique that works well across all the access patterns and different types of workloads. Through this paper, we propose Arsenal, a prefetching framework which allows the advantages provided by different data prefetchers to be combined, by dynamically selecting the best-suited prefetcher for the current workload. Thus effectively improving the versatility of the prefetching system. It bases on the classic Sandbox prefetcher that dynamically adapts and utilizes multiple offsets for sequential prefetchers. We take it to the next step by switching between prefetchers like Multi look Ahead Offset Prefetching and Timing SKID Prefetcher on the run.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Embedded Systems Design Techniques · Interconnection Networks and Systems
