Tuning for Software Analytics: is it Really Necessary?
Wei Fu, Tim Menzies, Xipeng Shen

TL;DR
This paper demonstrates that simple, automatic tuning methods significantly improve software defect prediction accuracy, challenging the assumption that extensive tuning is necessary for effective software analytics.
Contribution
It introduces an efficient differential evolution-based tuning approach that requires only tens of attempts to optimize defect predictors, showing substantial performance gains.
Findings
Tuning can dramatically improve defect prediction precision from 0% to 60%.
Simple tuning methods are surprisingly effective and quick.
Standard practices should incorporate tuning optimization for better results.
Abstract
Context: Data miners have been widely used in software engineering to, say, generate defect predictors from static code measures. Such static code defect predictors perform well compared to manual methods, and they are easy to use and useful to use. But one of the "black art" of data mining is setting the tunings that control the miner. Objective:We seek simple, automatic, and very effective method for finding those tunings. Method: For each experiment with different data sets (from open source JAVA systems), we ran differential evolution as anoptimizer to explore the tuning space (as a first step) then tested the tunings using hold-out data. Results: Contrary to our prior expectations, we found these tunings were remarkably simple: it only required tens, not thousands,of attempts to obtain very good results. For example, when learning software defect predictors, this method can quickly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
