The Impact of IR-based Classifier Configuration on the Performance and   the Effort of Method-Level Bug Localization

Chakkrit Tantithamthavorn; Surafel Lemma Abebe; Ahmed E. Hassan,; Akinori Ihara; Kenichi Matsumoto

arXiv:1806.07727·cs.SE·June 21, 2018

The Impact of IR-based Classifier Configuration on the Performance and the Effort of Method-Level Bug Localization

Chakkrit Tantithamthavorn, Surafel Lemma Abebe, Ahmed E. Hassan,, Akinori Ihara, Kenichi Matsumoto

PDF

1 Repo

TL;DR

This study evaluates how different IR-based classifier configurations affect bug localization effectiveness and effort, revealing significant impacts and identifying optimal configurations for method-level bug localization.

Contribution

It systematically analyzes over 3,000 classifier configurations, highlighting the importance of configuration choices and identifying the most efficient settings for bug localization.

Findings

01

Classifier configuration impacts top-k performance from 0.44% to 36%.

02

VSM achieves best performance and lowest effort.

03

Entity representation configurations have the most impact.

Abstract

Context: IR-based bug localization is a classifier that assists developers in locating buggy source code entities (e.g., files and methods) based on the content of a bug report. Such IR-based classifiers have various parameters that can be configured differently (e.g., the choice of entity representation). Objective: In this paper, we investigate the impact of the choice of the IR-based classifier configuration on the top-k performance and the required effort to examine source code entities before locating a bug at the method level. Method: We execute a large space of classifier configuration, 3,172 in total, on 5,266 bug reports of two software systems, i.e., Eclipse and Mozilla. Results: We find that (1) the choice of classifier configuration impacts the top-k performance from 0.44% to 36% and the required effort from 4,395 to 50,000 LOC; (2) classifier configurations with similar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SAILResearch/replication-ist_bug_localization
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.