# Easy over Hard: A Case Study on Deep Learning

**Authors:** Wei Fu, Tim Menzies

arXiv: 1703.00133 · 2017-06-27

## TL;DR

This paper compares deep learning with simpler optimization methods, demonstrating that in some cases, traditional techniques can achieve similar results much faster, urging caution in adopting complex models without baseline comparisons.

## Contribution

It shows that simple optimization methods like DE can outperform deep learning in training time while maintaining comparable accuracy, challenging the assumption that more complex models are always better.

## Key findings

- DE achieved similar results to deep learning in 10 minutes
- Deep learning took 14 hours for the same task
- Simple optimizers can be effective alternatives to deep learning

## Abstract

While deep learning is an exciting new technique, the benefits of this method need to be assessed with respect to its computational cost. This is particularly important for deep learning since these learners need hours (to weeks) to train the model. Such long training time limits the ability of (a)~a researcher to test the stability of their conclusion via repeated runs with different random seeds; and (b)~other researchers to repeat, improve, or even refute that original work.   For example, recently, deep learning was used to find which questions in the Stack Overflow programmer discussion forum can be linked together. That deep learning system took 14 hours to execute. We show here that applying a very simple optimizer called DE to fine tune SVM, it can achieve similar (and sometimes better) results. The DE approach terminated in 10 minutes; i.e. 84 times faster hours than deep learning method.   We offer these results as a cautionary tale to the software analytics community and suggest that not every new innovation should be applied without critical analysis. If researchers deploy some new and expensive process, that work should be baselined against some simpler and faster alternatives.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1703.00133/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1703.00133/full.md

## References

81 references — full list in the complete paper: https://tomesphere.com/paper/1703.00133/full.md

---
Source: https://tomesphere.com/paper/1703.00133