Faster Neural Network Training with Data Echoing

Dami Choi; Alexandre Passos; Christopher J. Shallue; George E. Dahl

arXiv:1907.05550·cs.LG·May 11, 2020·33 cites

Faster Neural Network Training with Data Echoing

Dami Choi, Alexandre Passos, Christopher J. Shallue, George E. Dahl

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces data echoing, a technique that reuses intermediate outputs in the data pipeline to reduce upstream computation and accelerate neural network training, especially as hardware accelerators improve.

Contribution

We propose data echoing, a novel method that reuses data pipeline outputs to speed up training, addressing bottlenecks in data preprocessing stages.

Findings

01

At least one data echoing algorithm matches baseline performance with less upstream computation.

02

Achieved a 3.25x reduction in training time for ResNet-50 on ImageNet.

03

Data echoing is effective across various workloads and batch sizes.

Abstract

In the twilight of Moore's law, GPUs and other specialized hardware accelerators have dramatically sped up neural network training. However, earlier stages of the training pipeline, such as disk I/O and data preprocessing, do not run on accelerators. As accelerators continue to improve, these earlier stages will increasingly become the bottleneck. In this paper, we introduce "data echoing," which reduces the total computation used by earlier pipeline stages and speeds up training whenever computation upstream from accelerators dominates the training time. Data echoing reuses (or "echoes") intermediate outputs from earlier pipeline stages in order to reclaim idle capacity. We investigate the behavior of different data echoing algorithms on various workloads, for various amounts of echoing, and for various batch sizes. We find that in all settings, at least one data echoing algorithm can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EdwardTyantov/siim-isic-melanoma-2020
pytorch

Videos

Faster Neural Network Training with Data Echoing (Paper Explained)· youtube

Taxonomy

TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and ELM