PerfRL: A Small Language Model Framework for Efficient Code Optimization
Shukai Duan, Nikos Kanakaris, Xiongye Xiao, Heng Ping, Chenyu Zhou,, Nesreen K. Ahmed, Guixiang Ma, Mihai Capota, Theodore L. Willke, Shahin, Nazarian, Paul Bogdan

TL;DR
PerfRL introduces a framework combining small language models and reinforcement learning to automate code optimization, achieving faster results with less computational resources and fewer errors.
Contribution
This paper presents PerfRL, a novel framework that enhances code optimization efficiency using small language models and reinforcement learning with environment feedback.
Findings
PerfRL outperforms existing models in speed and resource efficiency.
It reduces logical and syntactical errors in code optimization.
Achieves comparable or better optimization results with shorter training times.
Abstract
Code optimization is a challenging task requiring a substantial level of expertise from developers. Nonetheless, this level of human capacity is not sufficient considering the rapid evolution of new hardware architectures and software environments. In light of this, recent research proposes adopting machine learning and artificial intelligence techniques to automate the code optimization process. In this paper, we introduce PerfRL, an innovative framework designed to tackle the problem of code optimization. Our framework leverages the capabilities of small language models (SLMs) and reinforcement learning (RL), facilitating a system where SLMs can assimilate feedback from their environment during the fine-tuning phase, notably through unit tests. When benchmarked against existing models, PerfRL demonstrates superior efficiency in terms of speed and computational resource usage,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Ferroelectric and Negative Capacitance Devices · Software Testing and Debugging Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Gated Linear Unit · Byte Pair Encoding · Softmax · Residual Connection · Layer Normalization · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia?
