IRJIT: A Simple, Online, Information Retrieval Approach for Just-In-Time Software Defect Prediction
Hareem Sahar, Abdul Ali Bangash, Abram Hindle, Denilson Barbosa

TL;DR
IRJIT is an online, explainable, information retrieval-based method for just-in-time software defect prediction that is faster and maintains competitive accuracy compared to traditional machine learning models.
Contribution
The paper introduces IRJIT, a novel online and explainable approach for defect prediction using information retrieval, addressing the limitations of existing complex ML-based methods.
Findings
IRJIT is up to 112 times faster than state-of-the-art approaches.
IRJIT provides explainability at commit and line levels.
IRJIT achieves comparable prediction performance to existing methods.
Abstract
Just-in-Time software defect prediction (JIT-SDP) prevents the introduction of defects into the software by identifying them at commit check-in time. Current software defect prediction approaches rely on manually crafted features such as change metrics and involve expensive to train machine learning or deep learning models. These models typically involve extensive training processes that may require significant computational resources and time. These characteristics can pose challenges when attempting to update the models in real-time as new examples become available, potentially impacting their suitability for fast online defect prediction. Furthermore, the reliance on a complex underlying model makes these approaches often less explainable, which means the developers cannot understand the reasons behind models' predictions. An approach that is not explainable might not be adopted in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Software Reliability and Analysis Research
