Hyperparameter Optimization for Effort Estimation
Tianpei Xia, Rahul Krishna, Jianfeng Chen, George Mathew, Xipeng Shen, and Tim Menzies

TL;DR
This paper introduces a new hyperparameter optimization architecture called OIL for improving software effort estimation accuracy, demonstrating significant improvements with efficient tuning methods on a large dataset.
Contribution
The paper presents OIL, a novel hyperparameter optimization framework for effort estimation, and identifies effective tuning strategies that outperform existing methods in accuracy and efficiency.
Findings
Large improvements in effort estimation accuracy after tuning.
Regression trees (CART) with evolutionary tuning are highly effective.
Tuning can reduce optimization time from weeks to hours.
Abstract
Software analytics has been widely used in software engineering for many tasks such as generating effort estimates for software projects. One of the "black arts" of software analytics is tuning the parameters controlling a data mining algorithm. Such hyperparameter optimization has been widely studied in other software analytics domains (e.g. defect prediction and text mining) but, so far, has not been extensively explored for effort estimation. Accordingly, this paper seeks simple, automatic, effective and fast methods for finding good tunings for automatic software effort estimation. We introduce a hyperparameter optimization architecture called OIL (Optimized Inductive Learning). We test OIL on a wide range of hyperparameter optimizers using data from 945 software projects. After tuning, large improvements in effort estimation accuracy were observed (measured in terms of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
