JITLine: A Simpler, Better, Faster, Finer-grained Just-In-Time Defect   Prediction

Chanathip Pornprasit; Chakkrit Tantithamthavorn

arXiv:2103.07068·cs.SE·March 17, 2021

JITLine: A Simpler, Better, Faster, Finer-grained Just-In-Time Defect Prediction

Chanathip Pornprasit, Chakkrit Tantithamthavorn

PDF

TL;DR

JITLine is a new, efficient, and fine-grained approach for just-in-time defect prediction that outperforms existing methods in accuracy, speed, and line-level defect localization, aiding developers in prioritizing and identifying defects more effectively.

Contribution

We introduce JITLine, a novel approach that improves upon prior models by providing more accurate, faster, and line-level defect predictions in a just-in-time context.

Findings

01

JITLine achieves 26%-38% higher F-measure than existing methods.

02

JITLine is 70-100 times faster than CC2Vec and DeepJIT.

03

JITLine's line-level predictions are 133%-150% more accurate.

Abstract

A Just-In-Time (JIT) defect prediction model is a classifier to predict if a commit is defect-introducing. Recently, CC2Vec -- a deep learning approach for Just-In-Time defect prediction -- has been proposed. However, CC2Vec requires the whole dataset (i.e., training + testing) for model training, assuming that all unlabelled testing datasets would be available beforehand, which does not follow the key principles of just-in-time defect predictions. Our replication study shows that, after excluding the testing dataset for model training, the F-measure of CC2Vec is decreased by 38.5% for OpenStack and 45.7% for Qt, highlighting the negative impact of excluding the testing dataset for Just-In-Time defect prediction. In addition, CC2Vec cannot perform fine-grained predictions at the line level (i.e., which lines are most risky for a given commit). In this paper, we propose JITLine -- a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.