Cache Bypassing for Machine Learning Algorithms

Asim Ikram; Muhammad Awais Ali; Mirza Omer Beg

arXiv:2102.06892·cs.AR·February 16, 2021

Cache Bypassing for Machine Learning Algorithms

Asim Ikram, Muhammad Awais Ali, Mirza Omer Beg

PDF

Open Access

TL;DR

This paper introduces a predictive cache bypassing model for GPU-based machine learning algorithms, significantly improving cache efficiency and accuracy in data access pattern analysis.

Contribution

It proposes a novel predictive model that determines cache storage decisions for machine learning workloads on GPUs, enhancing performance and cache utilization.

Findings

01

Model achieves around 90% accuracy on various datasets.

02

KNN performs best among tested models.

03

Splitting addresses into chunks improves neural network accuracy to 99.9%.

Abstract

Graphics Processing Units (GPUs) were once used solely for graphical computation tasks but with the increase in the use of machine learning applications, the use of GPUs to perform general-purpose computing has increased in the last few years. GPUs employ a massive amount of threads, that in turn achieve a high amount of parallelism, to perform tasks. Though GPUs have a high amount of computation power, they face the problem of cache contention due to the SIMT model that they use. A solution to this problem is called "cache bypassing". This paper presents a predictive model that analyzes the access patterns of various machine learning algorithms and determines whether certain data should be stored in the cache or not. It presents insights on how well each model performs on different datasets and also shows how minimizing the size of each model will affect its performance The performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques