MLPerf Training Benchmark

Peter Mattson; Christine Cheng; Cody Coleman; Greg Diamos; Paulius; Micikevicius; David Patterson; Hanlin Tang; Gu-Yeon Wei; Peter Bailis; Victor; Bittorf; David Brooks; Dehao Chen; Debojyoti Dutta; Udit Gupta; Kim; Hazelwood; Andrew Hock; Xinyuan Huang; Atsushi Ike; Bill Jia; Daniel Kang,; David Kanter; Naveen Kumar; Jeffery Liao; Guokai Ma; Deepak Narayanan; Tayo; Oguntebi; Gennady Pekhimenko; Lillian Pentecost; Vijay Janapa Reddi; Taylor; Robie; Tom St. John; Tsuguchika Tabaru; Carole-Jean Wu; Lingjie Xu; Masafumi; Yamazaki; Cliff Young; Matei Zaharia

arXiv:1910.01500·cs.LG·March 3, 2020·172 cites

MLPerf Training Benchmark

Peter Mattson, Christine Cheng, Cody Coleman, Greg Diamos, Paulius, Micikevicius, David Patterson, Hanlin Tang, Gu-Yeon Wei, Peter Bailis, Victor, Bittorf, David Brooks, Dehao Chen, Debojyoti Dutta, Udit Gupta, Kim, Hazelwood, Andrew Hock, Xinyuan Huang, Atsushi Ike, Bill Jia

PDF

Open Access 2 Repos

TL;DR

MLPerf is a comprehensive benchmarking suite designed to evaluate and compare the performance of machine learning training across diverse hardware and software systems, addressing unique challenges like stochasticity and diversity.

Contribution

This paper introduces MLPerf, a standardized benchmark for ML training that overcomes key challenges such as variability, fairness, and diversity in hardware and software environments.

Findings

01

MLPerf effectively drives performance improvements across vendors.

02

Benchmark results show high variability in training times, highlighting the need for standardized metrics.

03

MLPerf facilitates fair comparison of ML training solutions.

Abstract

Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges absent from other domains: optimizations that improve training throughput can increase the time to solution, training is stochastic and time to solution exhibits high variance, and software and hardware systems are so diverse that fair benchmarking with the same binary, code, and even hyperparameters is difficult. We therefore present MLPerf, an ML benchmark that overcomes these challenges. Our analysis quantitatively evaluates MLPerf's efficacy at driving performance and scalability improvements across two rounds of results from multiple vendors.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Data Stream Mining Techniques