A Study on the Impact of Fault localization Granularity for Repository-Scale Code Repair Tasks
Joseph Townsend, Chandresh Pravin, Kwun Ho Ngan, Matthieu Parizy

TL;DR
This paper investigates how the granularity of fault localization affects the success of automatic code repair at the repository scale, using a modified framework to test different localization levels.
Contribution
It introduces a framework for isolating and testing the impact of fault localization granularity on repair success in repository-scale scenarios.
Findings
Function-level localization yields higher repair rates than line- or file-level.
The optimal granularity may depend on the specific repair task.
The study provides a proof of concept for analyzing fault localization's impact on repair.
Abstract
Automatic program repair can be a challenging task, especially when resolving complex issues at a repository-level, which often involves issue reproduction, fault localization, code repair, testing and validation. Issues of this scale can be commonly found in popular GitHub repositories or datasets that are derived from them. Some repository-level approaches separate localization and repair into distinct phases. Where this is the case, the fault localization approaches vary in terms of the granularity of localization. Where the impact of granularity is explored to some degree for smaller datasets, not all isolate this issue from the separate question of localization accuracy by testing code repair under the assumption of perfect fault localization. To the best of the authors' knowledge, no repository-scale studies have explicitly investigated granularity under this assumption, nor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
