Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency

Jun Yang; Cheng-Chi Wang; Bogdan Alexandru Stoica; Kexin Pei

arXiv:2505.23471·cs.SE·December 8, 2025

Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency

Jun Yang, Cheng-Chi Wang, Bogdan Alexandru Stoica, Kexin Pei

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces WEDGE, a framework that synthesizes performance constraints to generate stress tests for code, enabling better evaluation and optimization of code efficiency, and releases PERFFORGE as a benchmark suite.

Contribution

WEDGE is a novel framework that synthesizes explicit performance constraints to generate targeted stress tests for code, improving evaluation and optimization processes.

Findings

01

WEDGE significantly slows down code compared to existing tests.

02

Integrating WEDGE-generated tests enhances code optimization approaches.

03

PERFFORGE provides a new benchmark for evaluating code efficiency.

Abstract

Large Language Models (LLMs) have been increasingly used to optimize code efficiency. Evaluating their effectiveness and further suggesting optimization opportunities often rely on high-quality tests to demonstrate the performance bottlenecks presented in the program. However, existing approaches rely on a limited set of hand-curated inputs or LLM-generated uninteresting length-stressing tests, failing to reveal more nuanced optimization opportunities. We present WEDGE, a framework for generating performance-stressing input given the program under test. WEDGE synthesizes explicit performance-characterizing constraints in the form of branch conditions to partition the programs' execution space into performance-specific regions. When integrated with the coverage-guided fuzzer, reaching different regions introduces explicit rewards for test generation to explore inefficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uchiseclab/perfforge
noneOfficial

Videos

Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency· slideslive

Taxonomy

TopicsSoftware System Performance and Reliability · Software Engineering Research · Software Testing and Debugging Techniques