TL;DR
This paper introduces Property-Generated Solver (PGS), a feedback mechanism for LLM code refinement that emphasizes high-quality, property-oriented, and minimal feedback to improve correctness and fix rates.
Contribution
The paper proposes PGS, a novel feedback paradigm that enhances LLM code refinement by focusing on semantic properties and minimal counterexamples, outperforming existing methods.
Findings
PGS improves pass@1 by up to 13.4% over other TDD methods.
Achieves over 64% fix rate on initially failed problems.
Demonstrates a bug fix rate 1.4x-1.6x higher than previous approaches.
Abstract
LLMs excel at code generation, yet ensuring the functional correctness of their outputs remains a persistent challenge. While recent studies have applied Test-Driven Development (TDD) to refine code, these methods are often undermined by poor feedback quality, stemming from the scarcity of high-quality test cases and noisy signals from auto-generated ones. In this work, we shift the focus from test quantity to feedback quality. We introduce the Property-Generated Solver (PGS), a novel paradigm designed to generate highly effective feedback via two principles: it must be property-oriented, to provide semantic guidance beyond simple I/O mismatches, and structurally minimal, to reduce cognitive load and isolate root causes. PGS operates by checking high-level program properties (e.g., a sorting function must produce a non-decreasing sequence) then providing the simplest failing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
