Outcome-Conditioned Reasoning Distillation for Resolving Software Issues
Chenglin Li, Yisen Xu, Zehao Wang, Shin Hwei Tan, Tse-Hsun (Peter) Chen

TL;DR
This paper introduces O-CRD, a framework that leverages verified past repairs to guide software issue fixing, significantly improving efficiency and success rates without additional fine-tuning or search.
Contribution
The paper proposes a novel outcome-conditioned distillation method that reuses verified repairs to steer future software issue resolution, reducing reliance on costly search procedures.
Findings
Increases Pass@1 by over 8% across multiple models.
Effectively reuses past repairs to improve current fix accuracy.
Reduces inference cost by avoiding repeated search.
Abstract
Software issue resolution in large repositories is a long-range decision process: choices made during localization shape the space of viable edits, and missteps can compound into incorrect patches. Despite this, many LLM-based repair pipelines still operate in a reset-and-solve manner, producing fresh reasoning for every new issue instead of carrying forward what worked in past fixes. This is wasteful because repositories routinely contain earlier issues with overlapping structure, failure modes, or constraints, where prior repair experience could provide useful guidance. Existing approaches typically harvest this signal through forward-time trial procedures, such as repeated refinement or search, incurring high inference cost while still risking divergence from the eventual correct patch. We present an Outcome-Conditioned Reasoning Distillation(O-CRD) framework that uses resolved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software System Performance and Reliability
