A Method to Assess and Argue for Practical Significance in Software Engineering
Richard Torkar, Carlo A. Furia, Robert Feldt, Francisco Gomes de, Oliveira Neto, Lucas Gren, Per Lenberg, Neil A. Ernst

TL;DR
This paper advocates using Bayesian data analysis combined with cumulative prospect theory to rigorously assess and argue for practical significance in empirical software engineering studies, addressing a gap in standard techniques.
Contribution
It introduces a systematic Bayesian approach to evaluate practical significance, demonstrated through a case study reanalyzing existing data with advanced statistical modeling.
Findings
Bayesian analysis propagates data uncertainty transparently.
The combined approach clarifies practical relevance of effects.
Supports better decision-making for practitioners.
Abstract
A key goal of empirical research in software engineering is to assess practical significance, which answers whether the observed effects of some compared treatments show a relevant difference in practice in realistic scenarios. Even though plenty of standard techniques exist to assess statistical significance, connecting it to practical significance is not straightforward or routinely done; indeed, only a few empirical studies in software engineering assess practical significance in a principled and systematic way. In this paper, we argue that Bayesian data analysis provides suitable tools to assess practical significance rigorously. We demonstrate our claims in a case study comparing different test techniques. The case study's data was previously analyzed (Afzal et al., 2015) using standard techniques focusing on statistical significance. Here, we build a multilevel model of the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
