Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning   Preprocessing Pipelines

Alexander Isenko; Ruben Mayer; Jeffrey Jedele; Hans-Arno Jacobsen

arXiv:2202.08679·cs.LG·March 28, 2022

Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines

Alexander Isenko, Ruben Mayer, Jeffrey Jedele, Hans-Arno Jacobsen

PDF

1 Repo

TL;DR

This paper analyzes data preprocessing bottlenecks in deep learning pipelines, introduces a profiling library for optimization, and demonstrates up to 13x throughput improvements through tuning.

Contribution

It provides a new perspective on dataset preparation trade-offs and offers an open-source tool for automatic preprocessing strategy selection.

Findings

01

Up to 13x throughput increase with tuning

02

Trade-offs between throughput, preprocessing time, and storage

03

Open-source profiling library for pipeline optimization

Abstract

Preprocessing pipelines in deep learning aim to provide sufficient data throughput to keep the training processes busy. Maximizing resource utilization is becoming more challenging as the throughput of training processes increases with hardware innovations (e.g., faster GPUs, TPUs, and inter-connects) and advanced parallelization techniques that yield better scalability. At the same time, the amount of training data needed in order to train increasingly complex models is growing. As a consequence of this development, data preprocessing and provisioning are becoming a severe bottleneck in end-to-end deep learning pipelines. In this paper, we provide an in-depth analysis of data preprocessing pipelines from four different machine learning domains. We introduce a new perspective on efficiently preparing datasets for end-to-end deep learning pipelines and extract individual trade-offs to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cirquit/presto
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.