Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads
Siyu Wang, Yi Rong, Shiqing Fan, Zhen Zheng, LanSong Diao, Guoping, Long, Jun Yang, Xiaoyong Liu, Wei Lin

TL;DR
Auto-MAP is a reinforcement learning framework that automatically discovers efficient distributed execution plans for DNN workloads, reducing manual effort and improving throughput across various models.
Contribution
It introduces Auto-MAP, a novel DQN-based framework that automates the exploration of distributed execution strategies at the IR level for DNNs.
Findings
Auto-MAP finds optimal strategies within two hours.
Auto-MAP achieves higher throughput on NLP and convolution models.
Efficient exploration is enabled by task-specific pruning in DQN.
Abstract
The last decade has witnessed growth in the computational requirements for training deep neural networks. Current approaches (e.g., data/model parallelism, pipeline parallelism) parallelize training tasks onto multiple devices. However, these approaches always rely on specific deep learning frameworks and requires elaborate manual design, which make it difficult to maintain and share between different type of models. In this paper, we propose Auto-MAP, a framework for exploring distributed execution plans for DNN workloads, which can automatically discovering fast parallelization strategies through reinforcement learning on IR level of deep learning models. Efficient exploration remains a major challenge for reinforcement learning. We leverage DQN with task-specific pruning strategies to help efficiently explore the search space including optimized strategies. Our evaluation shows that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices
MethodsPruning · Q-Learning · Dense Connections · Deep Q-Network · Convolution
