Dataloader Parameter Tuner: An Automated Dataloader Parameter Tuner for   Deep Learning Models

JooYoung Park; DoangJoo Synn; XinYu Piao; Jong-Kook Kim

arXiv:2210.05244·cs.DC·October 12, 2022·1 cites

Dataloader Parameter Tuner: An Automated Dataloader Parameter Tuner for Deep Learning Models

JooYoung Park, DoangJoo Synn, XinYu Piao, Jong-Kook Kim

PDF

Open Access

TL;DR

This paper introduces Dataloader Parameter Tuner (DPT), an automated framework that optimizes dataloader parameters like subprocesses and prefetch factor to enhance data loading efficiency in deep learning models.

Contribution

The paper presents a novel automated framework that uses grid search to find optimal dataloader parameters, improving data transfer speed in deep learning systems.

Findings

01

Optimizes dataloader subprocesses and prefetch factor.

02

Accelerates data transfer in deep learning workflows.

03

Automates parameter tuning process.

Abstract

Deep learning has recently become one of the most compute/data-intensive methods and is widely used in many research areas and businesses. One of the critical challenges of deep learning is that it has many parameters that can be adjusted, and the optimal value may need to be determined for faster operation and high accuracy. The focus of this paper is the adjustable parameters of the dataloader. The dataloader in a system mainly groups the data appropriately and loads it to the main memory for the deep learning model to use. We introduce an automated framework called Dataloader Parameter Tuner (DPT) that determines the optimal value for the parameters required for the dataloader. This framework discovers the optimal values for the number of dataloader's subprocesses (i.e., worker) and prefetch factor through grid search to accelerate the data transfer for machine learning systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Parallel Computing and Optimization Techniques · Advanced Neural Network Applications