A Data-driven Analysis of Code Optimizations

Yacine Hakimi; Riyadh Baghdadi

arXiv:2511.06117·cs.PL·November 11, 2025

A Data-driven Analysis of Code Optimizations

Yacine Hakimi, Riyadh Baghdadi

PDF

Open Access

TL;DR

This paper uses a data-driven approach to analyze how different sequences of automatic code transformations affect performance, aiming to improve compiler optimization strategies efficiently.

Contribution

It introduces a large dataset of randomized program transformations and applies statistical analysis to inform better optimization heuristics.

Findings

01

Predefined fixed sequences can speed up optimization search.

02

Random transformation sequences reveal interaction effects.

03

Data-driven insights guide more efficient optimization algorithms.

Abstract

As the demand for computational power grows, optimizing code through compilers becomes increasingly crucial. In this context, we focus on fully automatic code optimization techniques that automate the process of selecting and applying code transformations for better performance without manual intervention. Understanding how these transformations behave and interact is key to designing more effective optimization strategies. Compiler developers must make numerous design choices when constructing these heuristics. For instance, they may decide whether to allow transformations to be explored in any arbitrary order or to enforce a fixed sequence. While the former may theoretically offer the best performance gains, it significantly increases the search space. This raises an important question: Can a predefined, fixed order of applying transformations speed up the search without severely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Logic, programming, and type systems · Software Engineering Research