EGSS: Entropy-guided Stepwise Scaling for Reliable Software Engineering

Chenhui Mao; Yuanting Lei; Zhixiang Wei; Ming Liang; Zhixiang Wang; Jingxuan Xu; Dajun Chen; Wei Jiang; Yong Li

arXiv:2602.05242·cs.SE·May 14, 2026

EGSS: Entropy-guided Stepwise Scaling for Reliable Software Engineering

Chenhui Mao, Yuanting Lei, Zhixiang Wei, Ming Liang, Zhixiang Wang, Jingxuan Xu, Dajun Chen, Wei Jiang, Yong Li

PDF

TL;DR

EGSS introduces an entropy-guided adaptive framework for test-time scaling in software engineering tasks, significantly improving performance and efficiency over existing methods.

Contribution

The paper presents a novel entropy-guided stepwise scaling approach that dynamically balances efficiency and effectiveness in test-time scaling for software engineering models.

Findings

01

EGSS boosts performance by 5-10% across models.

02

Increases resolved ratio of specific benchmarks from 63.2% to 72.2%.

03

Reduces inference token usage by over 28%.

Abstract

Agentic Test-Time Scaling (TTS) has delivered state-of-the-art (SOTA) performance on complex software engineering tasks such as code generation and bug fixing. However, its practical adoption remains limited due to significant computational overhead, primarily driven by two key challenges: (1) the high cost associated with deploying excessively large ensembles, and (2) the lack of a reliable mechanism for selecting the optimal candidate solution, ultimately constraining the performance gains that can be realized. To address these challenges, we propose Entropy-Guided Stepwise Scaling (EGSS), a novel TTS framework that dynamically balances efficiency and effectiveness through entropy-guided adaptive search and robust test-suite augmentation. Extensive experiments on SWE-Bench-Verified demonstrate that EGSS consistently boosts performance by 5-10% across all evaluated models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.