Measuring the Algorithmic Efficiency of Neural Networks
Danny Hernandez, Tom B. Brown

TL;DR
This paper quantifies algorithmic progress in AI by measuring reductions in compute required for training neural networks, revealing a 44x decrease from 2012 to 2019 and emphasizing the importance of combining hardware and algorithmic efficiency.
Contribution
It introduces a method to measure algorithmic efficiency in neural networks and demonstrates its significance in understanding AI progress over time.
Findings
Compute needed for AlexNet-level on ImageNet decreased 44x from 2012 to 2019.
Algorithmic efficiency doubled approximately every 16 months.
Hardware and algorithmic improvements multiply, influencing overall AI progress.
Abstract
Three factors drive the advance of AI: algorithmic innovation, data, and the amount of compute available for training. Algorithmic progress has traditionally been more difficult to quantify than compute and data. In this work, we argue that algorithmic progress has an aspect that is both straightforward to measure and interesting: reductions over time in the compute needed to reach past capabilities. We show that the number of floating-point operations required to train a classifier to AlexNet-level performance on ImageNet has decreased by a factor of 44x between 2012 and 2019. This corresponds to algorithmic efficiency doubling every 16 months over a period of 7 years. By contrast, Moore's Law would only have yielded an 11x cost improvement. We observe that hardware and algorithmic efficiency gains multiply and can be on a similar scale over meaningful horizons, which suggests that a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
