Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency
Jun Yang, Cheng-Chi Wang, Bogdan Alexandru Stoica, Kexin Pei

TL;DR
This paper introduces WEDGE, a framework that synthesizes performance constraints to generate stress tests for code, enabling better evaluation and optimization of code efficiency, and releases PERFFORGE as a benchmark suite.
Contribution
WEDGE is a novel framework that synthesizes explicit performance constraints to generate targeted stress tests for code, improving evaluation and optimization processes.
Findings
WEDGE significantly slows down code compared to existing tests.
Integrating WEDGE-generated tests enhances code optimization approaches.
PERFFORGE provides a new benchmark for evaluating code efficiency.
Abstract
Large Language Models (LLMs) have been increasingly used to optimize code efficiency. Evaluating their effectiveness and further suggesting optimization opportunities often rely on high-quality tests to demonstrate the performance bottlenecks presented in the program. However, existing approaches rely on a limited set of hand-curated inputs or LLM-generated uninteresting length-stressing tests, failing to reveal more nuanced optimization opportunities. We present WEDGE, a framework for generating performance-stressing input given the program under test. WEDGE synthesizes explicit performance-characterizing constraints in the form of branch conditions to partition the programs' execution space into performance-specific regions. When integrated with the coverage-guided fuzzer, reaching different regions introduces explicit rewards for test generation to explore inefficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSoftware System Performance and Reliability · Software Engineering Research · Software Testing and Debugging Techniques
