# Better Software Analytics via "DUO": Data Mining Algorithms   Using/Used-by Optimizers

**Authors:** Amritanshu Agrawal, Tim Menzies, Leandro L. Minku, Markus Wagner, Zhe, Yu

arXiv: 1812.01550 · 2020-08-31

## TL;DR

This paper introduces DUO, a novel approach combining data mining algorithms with optimizers to enhance empirical software engineering studies, enabling better models and more reliable results.

## Contribution

It proposes a new integrated framework called DUO that leverages both data mining and optimization techniques for improved software analytics.

## Key findings

- Optimizers can significantly improve data mining model performance.
- Using DUO leads to faster and more accurate predictive models.
- Unoptimized data miners can produce results that are easily refuted without optimization.

## Abstract

This paper claims that a new field of empirical software engineering research and practice is emerging: data mining using/used-by optimizers for empirical studies or DUO. For example, data miners can generate models that are explored by optimizers. Also, optimizers can advise how to best adjust the control parameters of a data miner. This combined approach acts like an agent leaning over the shoulder of an analyst that advises "ask this question next" or "ignore that problem, it is not relevant to your goals". Further, those agents can help us build "better" predictive models, where "better" can be either greater predictive accuracy or faster modeling time (which, in turn, enables the exploration of a wider range of options). We also caution that the era of papers that just use data miners is coming to an end. Results obtained from an unoptimized data miner can be quickly refuted, just by applying an optimizer to produce a different (and better performing) model. Our conclusion, hence, is that for software analytics it is possible, useful and necessary to combine data mining and optimization using DUO.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.01550/full.md

## Figures

23 figures with captions in the complete paper: https://tomesphere.com/paper/1812.01550/full.md

## References

136 references — full list in the complete paper: https://tomesphere.com/paper/1812.01550/full.md

---
Source: https://tomesphere.com/paper/1812.01550