Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning

Haolin Pan; Hongyu Lin; Haoran Luo; Yang Liu; Kaichun Yao; Libo Zhang; Mingjie Xing; Yanjun Wu

arXiv:2506.15701·cs.LG·June 23, 2025

Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning

Haolin Pan, Hongyu Lin, Haoran Luo, Yang Liu, Kaichun Yao, Libo Zhang, Mingjie Xing, Yanjun Wu

PDF

Open Access

TL;DR

Compiler-R1 introduces a reinforcement learning framework that enhances large language models for compiler auto-tuning, achieving significant IR instruction count reductions through a curated dataset and novel training pipeline.

Contribution

It presents the first RL-driven approach that augments LLMs for compiler auto-tuning, addressing dataset and interaction challenges with a two-stage training pipeline.

Findings

01

Achieved an average 8.46% IR instruction count reduction.

02

Demonstrated effectiveness across seven datasets.

03

Showcases potential of RL-trained LLMs in compiler optimization.

Abstract

Compiler auto-tuning optimizes pass sequences to improve performance metrics such as Intermediate Representation (IR) instruction count. Although recent advances leveraging Large Language Models (LLMs) have shown promise in automating compiler tuning, two significant challenges still remain: the absence of high-quality reasoning datasets for agents training, and limited effective interactions with the compilation environment. In this work, we introduce Compiler-R1, the first reinforcement learning (RL)-driven framework specifically augmenting LLM capabilities for compiler auto-tuning. Compiler-R1 features a curated, high-quality reasoning dataset and a novel two-stage end-to-end RL training pipeline, enabling efficient environment exploration and learning through an outcome-based reward. Extensive experiments across seven datasets demonstrate Compiler-R1 achieving an average 8.46% IR…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Multimodal Machine Learning Applications · Natural Language Processing Techniques