Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training
Jiawen Liu, Dong Li, Gokcen Kestor, Jeffrey Vetter

TL;DR
This paper presents an extension to the TensorFlow runtime that automatically manages concurrency and schedules neural network operations, significantly improving training performance by up to 49%.
Contribution
It introduces a lightweight, accurate performance model and scheduling strategies for automatic concurrency control and operation scheduling in neural network training.
Findings
Achieves 33% average performance improvement over default configurations.
Demonstrates up to 49% performance gain on neural network models.
Provides a runtime system that closely approaches manually optimized performance.
Abstract
Training neural network often uses a machine learning framework such as TensorFlow and Caffe2. These frameworks employ a dataflow model where the NN training is modeled as a directed graph composed of a set of nodes. Operations in neural network training are typically implemented by the frameworks as primitives and represented as nodes in the dataflow graph. Training NN models in a dataflow-based machine learning framework involves a large number of fine-grained operations. Those operations have diverse memory access patterns and computation intensity. How to manage and schedule those operations is challenging, because we have to decide the number of threads to run each operation (concurrency control) and schedule those operations for good hardware utilization and system throughput. In this paper, we extend an existing runtime system (the TensorFlow runtime) to enable automatic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices
