Benchmarking Neural Network Training Algorithms

George E. Dahl; Frank Schneider; Zachary Nado; Naman Agarwal; Chandramouli Shama Sastry; Philipp Hennig; Sourabh Medapati; Runa Eschenhagen; Priya Kasimbeg; Daniel Suo; Juhan Bae; Justin Gilmer; Abel L. Peirson; Bilal Khan; Rohan Anil; Mike Rabbat; Shankar Krishnan; Daniel Snider; Ehsan Amid; Kongtao Chen; Chris J. Maddison; Rakshith Vasudev; Michal Badura; Ankush Garg; Peter Mattson

arXiv:2306.07179·cs.LG·June 19, 2025·6 cites

Benchmarking Neural Network Training Algorithms

George E. Dahl, Frank Schneider, Zachary Nado, Naman Agarwal, Chandramouli Shama Sastry, Philipp Hennig, Sourabh Medapati, Runa Eschenhagen, Priya Kasimbeg, Daniel Suo, Juhan Bae, Justin Gilmer, Abel L. Peirson, Bilal Khan, Rohan Anil, Mike Rabbat, Shankar Krishnan

PDF

Open Access 4 Repos

TL;DR

This paper introduces a new benchmarking framework called AlgoPerf for evaluating neural network training algorithms, addressing key challenges in measuring training speed and robustness across workloads.

Contribution

The paper presents a novel, standardized benchmark for training algorithms that accounts for workload variability and hyperparameter tuning, enabling fairer and more reliable comparisons.

Findings

01

Baseline optimizers show measurable gaps in performance.

02

The benchmark demonstrates the feasibility of comparing training algorithms.

03

Robustness to workload changes varies among methods.

Abstract

Training algorithms, broadly construed, are an essential part of every deep learning pipeline. Training algorithm improvements that speed up training across a wide variety of workloads (e.g., better update rules, tuning protocols, learning rate schedules, or data selection schemes) could save time, save computational resources, and lead to better, more accurate, models. Unfortunately, as a community, we are currently unable to reliably identify training algorithm improvements, or even determine the state-of-the-art training algorithm. In this work, using concrete experiments, we argue that real progress in speeding up training requires new benchmarks that resolve three basic challenges faced by empirical comparisons of training algorithms: (1) how to decide when training is complete and precisely measure training time, (2) how to handle the sensitivity of measurements to exact workload…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings