Reasoning Distillation for Lightweight Automated Program Repair
Aanand Balasubramanian, Sashank Silwal

TL;DR
This paper introduces a reasoning distillation method that uses symbolic reasoning supervision from a large teacher model to enhance the performance and interpretability of small, resource-efficient automated program repair models.
Contribution
It proposes a novel reasoning distillation approach that improves fix-type classification in lightweight models by incorporating high-level causal reasoning signals from a large teacher model.
Findings
Reasoning supervision improves macro performance, especially on rare bug categories.
Correct reasoning traces are strongly correlated with correct fix predictions.
The approach enhances interpretability and robustness without increasing model complexity.
Abstract
We study whether lightweight symbolic reasoning supervision can improve fix type classification in compact automated program repair models. Small code models are attractive for resource-constrained settings, but they typically produce only a single prediction, making it unclear whether they learn meaningful program structure or rely on shallow correlations. We propose a reasoning distillation approach in which a large teacher model provides structured symbolic reasoning tags alongside fix-type labels. These tags capture high-level causal properties of bugs without relying on free-form explanations. We train a CodeT5-based student model under label-only and reasoning-distilled settings on the IntroClass benchmark. Reasoning supervision consistently improves macro averaged performance, particularly on less frequent bug categories, without increasing model size or complexity. We further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software System Performance and Reliability
