PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
Muhammad Usman Tariq, Abhinav Jangda, Angelica Moreira, Madan Musuvathi, Tyler Sorensen

TL;DR
PEAK is an AI-assisted framework that uses natural language transformations and large language models to optimize GPU kernel code across different backends, achieving performance comparable to vendor libraries.
Contribution
This work introduces PEAK, a novel system leveraging natural language transformations and LLMs for GPU kernel optimization, adaptable to multiple backends and kernels.
Findings
PEAK's implementations match hardware FLOPS for HLSL kernels.
Optimizations are competitive with vendor libraries.
The system enables detailed analysis of LLM behavior in code transformation.
Abstract
Advancements in large language models (LLMs) are showing promising impact in software development and programming assistance. However, these models struggle when operating on low-level backend code. This challenge is exacerbated in the domain of GPU kernels, where performance-critical details are coupled to rapidly evolving hardware characteristics and available code examples are sparse. In this work, we introduce PEAK, a Performance Engineering AI-Assistant for GPU Kernels powered by natural language transformations. PEAK utilizes the key insight that iterative code transformations (optimizations) can straightforwardly be written in natural language, and then carried out by LLMs. Thus, these transformations can be rapidly developed, encoding general portable optimizations, but also easily specialized to specific GPU devices and even kernels. These natural transformations are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Big Data and Digital Economy · Cloud Computing and Resource Management
