A Data-driven Analysis of Code Optimizations
Yacine Hakimi, Riyadh Baghdadi

TL;DR
This paper uses a data-driven approach to analyze how different sequences of automatic code transformations affect performance, aiming to improve compiler optimization strategies efficiently.
Contribution
It introduces a large dataset of randomized program transformations and applies statistical analysis to inform better optimization heuristics.
Findings
Predefined fixed sequences can speed up optimization search.
Random transformation sequences reveal interaction effects.
Data-driven insights guide more efficient optimization algorithms.
Abstract
As the demand for computational power grows, optimizing code through compilers becomes increasingly crucial. In this context, we focus on fully automatic code optimization techniques that automate the process of selecting and applying code transformations for better performance without manual intervention. Understanding how these transformations behave and interact is key to designing more effective optimization strategies. Compiler developers must make numerous design choices when constructing these heuristics. For instance, they may decide whether to allow transformations to be explored in any arbitrary order or to enforce a fixed sequence. While the former may theoretically offer the best performance gains, it significantly increases the search space. This raises an important question: Can a predefined, fixed order of applying transformations speed up the search without severely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Logic, programming, and type systems · Software Engineering Research
