Does class size matter? An in-depth assessment of the effect of class   size in software defect prediction

Amjed Tahir; Kwabena E. Bennin; Xun Xiao; Stephen G.; MacDonell

arXiv:2106.04687·cs.SE·June 10, 2021

Does class size matter? An in-depth assessment of the effect of class size in software defect prediction

Amjed Tahir, Kwabena E. Bennin, Xun Xiao, Stephen G., MacDonell

PDF

TL;DR

This study investigates how class size influences the relationship between object-oriented metrics and software defects, revealing that size's impact varies across metrics and should be carefully considered in defect prediction models.

Contribution

It provides a detailed analysis of class size effects using mediation and moderation analysis, offering new insights and statistical procedures for software defect prediction research.

Findings

01

Size significantly mediates the relationship between CBO and defects.

02

Size potentially moderates the relationship between Fan-out and defects.

03

The impact of size varies across different OO metrics.

Abstract

In the past 20 years, defect prediction studies have generally acknowledged the effect of class size on software prediction performance. To quantify the relationship between object-oriented (OO) metrics and defects, modelling has to take into account the direct, and potentially indirect, effects of class size on defects. However, some studies have shown that size cannot be simply controlled or ignored, when building prediction models. As such, there remains a question whether, and when, to control for class size. This study provides a new in-depth examination of the impact of class size on the relationship between OO metrics and software defects or defect-proneness. We assess the impact of class size on the number of defects and defect-proneness in software systems by employing a regression-based mediation (with bootstrapping) and moderation analysis to investigate the direct and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.