Bayesian Optimization for auto-tuning GPU kernels

Floris-Jan Willemsen; Rob van Nieuwpoort; Ben van Werkhoven

arXiv:2111.14991·cs.LG·December 1, 2021

Bayesian Optimization for auto-tuning GPU kernels

Floris-Jan Willemsen, Rob van Nieuwpoort, Ben van Werkhoven

PDF

2 Repos

TL;DR

This paper applies Bayesian Optimization to efficiently find optimal configurations for GPU kernels, addressing challenges of discrete, constrained, and invalid configurations, and demonstrating superior performance over existing methods.

Contribution

It introduces novel techniques for Bayesian Optimization tailored to GPU kernel tuning, including a contextual variance exploration factor and scalable acquisition functions.

Findings

01

Our method outperforms existing search strategies significantly.

02

The approach generalizes well across different test cases.

03

It effectively handles complex, constrained search spaces.

Abstract

Finding optimal parameter configurations for tunable GPU kernels is a non-trivial exercise for large search spaces, even when automated. This poses an optimization task on a non-convex search space, using an expensive to evaluate function with unknown derivative. These characteristics make a good candidate for Bayesian Optimization, which has not been applied to this problem before. However, the application of Bayesian Optimization to this problem is challenging. We demonstrate how to deal with the rough, discrete, constrained search spaces, containing invalid configurations. We introduce a novel contextual variance exploration factor, as well as new acquisition functions with improved scalability, combined with an informed acquisition function selection mechanism. By comparing the performance of our Bayesian Optimization implementation on various test cases to the existing search…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.