CREF: An LLM-based Conversational Software Repair Framework for Programming Tutors
Boyang Yang, Haoye Tian, Weiguo Pian, Haoran Yu, Haitao Wang, Jacques, Klein, Tegawend\'e F. Bissyand\'e, Shunfu Jin

TL;DR
This paper introduces CREF, a conversational framework leveraging LLMs and tutor interactions to improve program repair accuracy in educational settings, demonstrating significant performance gains and practical benefits.
Contribution
It presents a new non-crawled benchmark, analyzes the impact of additional information on LLM repair performance, and develops CREF, a novel conversational repair system for programming tutors.
Findings
Tutor guidance significantly improves LLM repair performance.
CREF achieves up to 24.6% improvement in repair accuracy.
Using GPT-4, CREF reaches an AVG-5 of 76.6%.
Abstract
Program repair techniques offer cost-saving benefits for debugging within software development and programming education scenarios. With the proven effectiveness of Large Language Models (LLMs) in code-related tasks, researchers have explored their potential for program repair. However, it is crucial to recognize that existing repair benchmarks may have influenced LLM training data, potentially causing data leakage. To evaluate LLMs' realistic repair capabilities, (1) we introduce an extensive, non-crawled benchmark, referred to as TutorCode, comprising 1,239 C++ defect codes and associated information such as tutor guidance, solution description, failing test cases, and the corrected code. Our work assesses the repair performance of 12 LLMs on TutorCode, measuring repair correctness (TOP-5 and AVG-5) and patch precision (RPSR). (2) We then provide a comprehensive investigation into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics · Text Readability and Simplification
