ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via   Learned Finite State Machines

Siyuan Chen; Pratik Fegade; Tianqi Chen; Phillip B. Gibbons; Todd C.; Mowry

arXiv:2302.03851·cs.LG·February 9, 2023

ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines

Siyuan Chen, Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C., Mowry

PDF

Open Access 1 Video

TL;DR

This paper introduces ED-Batch, a reinforcement learning-based method that automatically discovers efficient batching policies for dynamic neural networks, significantly improving execution speed and reducing data movement costs.

Contribution

It proposes a novel finite state machine approach combined with reinforcement learning to optimize batching policies for dynamic DNNs, addressing limitations of heuristic methods.

Findings

01

Speeds up state-of-the-art frameworks by up to 2.45x

02

Reduces data movement overheads through memory-aware batching

03

Effective across CPU and GPU for various DNN structures

Abstract

Batching has a fundamental influence on the efficiency of deep neural network (DNN) execution. However, for dynamic DNNs, efficient batching is particularly challenging as the dataflow graph varies per input instance. As a result, state-of-the-art frameworks use heuristics that result in suboptimal batching decisions. Further, batching puts strict restrictions on memory adjacency and can lead to high data movement costs. In this paper, we provide an approach for batching dynamic DNNs based on finite state machines, which enables the automatic discovery of batching policies specialized for each DNN via reinforcement learning. Moreover, we find that memory planning that is aware of the batching policy can save significant data movement overheads, which is automated by a PQ tree-based algorithm we introduce. Experimental results show that our framework speeds up state-of-the-art frameworks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification

MethodsAttentive Walk-Aggregating Graph Neural Network