Fine-Grained Address Segmentation for Attention-Based Variable-Degree   Prefetching

Pengmiao Zhang; Ajitesh Srivastava; Anant V. Nori; Rajgopal Kannan,; Viktor K. Prasanna

arXiv:2205.02269·cs.AR·May 6, 2022

Fine-Grained Address Segmentation for Attention-Based Variable-Degree Prefetching

Pengmiao Zhang, Ajitesh Srivastava, Anant V. Nori, Rajgopal Kannan,, Viktor K. Prasanna

PDF

1 Repo

TL;DR

This paper introduces TransFetch, an attention-based model utilizing fine-grained address segmentation and delta bitmaps to improve memory prefetching accuracy and performance, significantly outperforming existing methods.

Contribution

The paper proposes a novel prefetching model, TransFetch, that effectively models address prediction using address segmentation and attention mechanisms, addressing large vocabulary issues.

Findings

01

Address segmentation improves F1-score by up to 36%.

02

TransFetch achieves 38.75% IPC improvement over no prefetching.

03

Outperforms rule-based BOP and ML-based Voyager prefetchers by over 6%.

Abstract

Machine learning algorithms have shown potential to improve prefetching performance by accurately predicting future memory accesses. Existing approaches are based on the modeling of text prediction, considering prefetching as a classification problem for sequence prediction. However, the vast and sparse memory address space leads to large vocabulary, which makes this modeling impractical. The number and order of outputs for multiple cache line prefetching are also fundamentally different from text prediction. We propose TransFetch, a novel way to model prefetching. To reduce vocabulary size, we use fine-grained address segmentation as input. To predict unordered sets of future addresses, we use delta bitmaps for multiple outputs. We apply an attention-based network to learn the mapping between input and output. Prediction experiments demonstrate that address segmentation achieves 26% -…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pgroupatusc/transfetch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.