You Cannot Fix What You Cannot Find! An Investigation of Fault   Localization Bias in Benchmarking Automated Program Repair Systems

Kui Liu; Anil Koyuncu; Tegawend\'e F. Bissyand\'e; Dongsun; Kim; Jacques Klein; Yves Le Traon

arXiv:1812.07283·cs.SE·February 18, 2019·1 cites

You Cannot Fix What You Cannot Find! An Investigation of Fault Localization Bias in Benchmarking Automated Program Repair Systems

Kui Liu, Anil Koyuncu, Tegawend\'e F. Bissyand\'e, Dongsun, Kim, Jacques Klein, Yves Le Traon

PDF

Open Access

TL;DR

This paper investigates how fault localization bias affects benchmarking of Automated Program Repair systems, revealing that current practices may mislead performance comparisons and emphasizing the need for transparent evaluation procedures.

Contribution

It identifies the impact of fault localization configurations on APR benchmarking, advocating for standardized, transparent evaluation to ensure fair comparisons.

Findings

01

Only a subset of bugs can be localized by common FL techniques.

02

FL configuration bias can mislead APR performance comparisons.

03

Authors often do not disclose tuning parameters affecting results.

Abstract

Properly benchmarking Automated Program Repair (APR) systems should contribute to the development and adoption of the research outputs by practitioners. To that end, the research community must ensure that it reaches significant milestones by reliably comparing state-of-the-art tools for a better understanding of their strengths and weaknesses. In this work, we identify and investigate a practical bias caused by the fault localization (FL) step in a repair pipeline. We propose to highlight the different fault localization configurations used in the literature, and their impact on APR systems when applied to the Defects4J benchmark. Then, we explore the performance variations that can be achieved by `tweaking' the FL step. Eventually, we expect to create a new momentum for (1) full disclosure of APR experimental procedures with respect to FL, (2) realistic expectations of repairing bugs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Software Reliability and Analysis Research · Software System Performance and Reliability