SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution
Chengxing Xie, Bowen Li, Chang Gao, He Du, Wai Lam, Difan Zou, Kai, Chen

TL;DR
SWE-Fixer is an open-source framework that enhances GitHub issue resolution by combining efficient retrieval and code editing modules, trained on a large dataset, achieving state-of-the-art results with minimal model calls.
Contribution
The paper introduces SWE-Fixer, a novel open-source framework with a large dataset for effective and efficient GitHub issue resolution, improving reproducibility and transparency.
Findings
Achieves 24.7% on Lite and 32.8% on Verified benchmarks.
Requires only two model calls per instance, enhancing efficiency.
Outperforms existing open-source models in code fixing tasks.
Abstract
Large Language Models (LLMs) have demonstrated remarkable proficiency across a variety of complex tasks. One significant application of LLMs is in tackling software engineering challenges, particularly in resolving real-world tasks on GitHub by fixing code based on the issues reported by the users. However, many current approaches rely on proprietary LLMs, which limits reproducibility, accessibility, and transparency. The critical components of LLMs for addressing software engineering issues and how their capabilities can be effectively enhanced remain unclear. To address these challenges, we introduce SWE-Fixer, a novel open-source framework designed to effectively and efficiently resolve GitHub issues. SWE-Fixer comprises two essential modules: a code file retrieval module and a code editing module. The retrieval module employs BM25 along with a lightweight model to achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Web Data Mining and Analysis · Data Mining Algorithms and Applications
