PaT: Planning-after-Trial for Efficient Test-Time Code Generation

Youngsik Yoon; Sungjae Lee; Seockbean Song; Siwei Wang; Wei Chen; Jungseul Ok

arXiv:2605.07248·cs.CL·May 11, 2026

PaT: Planning-after-Trial for Efficient Test-Time Code Generation

Youngsik Yoon, Sungjae Lee, Seockbean Song, Siwei Wang, Wei Chen, Jungseul Ok

PDF

TL;DR

PaT introduces an adaptive, cost-efficient approach for test-time code generation that selectively invokes planning only upon verification failure, significantly reducing inference costs while maintaining high performance.

Contribution

It proposes Planning-after-Trial (PaT), a novel adaptive policy that improves efficiency by combining heterogeneous models and selective planning during test-time code generation.

Findings

01

Achieves approximately 69% reduction in inference cost.

02

Significantly improves the cost-performance Pareto frontier.

03

Maintains performance comparable to larger homogeneous models.

Abstract

Beyond training-time optimization, scaling test-time computation has emerged as a key paradigm to extend the reasoning capabilities of Large Language Models (LLMs). However, most existing methods adopt a rigid Planning-before-Trial (PbT) policy, which inefficiently allocates test-time compute by incurring planning overhead even on directly solvable problems. We propose Planning-after-Trial (PaT), an adaptive policy for code generation that invokes a planner only upon verification failure. This adaptive policy naturally enables a heterogeneous model configuration: a cost-efficient model handles generation attempts, while a powerful model is reserved for targeted planning interventions. Empirically, across multiple benchmarks and model families, our approach significantly advances the cost-performance Pareto frontier. Notably, our heterogeneous configuration achieves performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.