Transferable Graph Optimizers for ML Compilers
Yanqi Zhou, Sudip Roy, Amirali Abdolrashidi, Daniel Wong, Peter Ma,, Qiumin Xu, Hanxiao Liu, Phitchaya Mangpo Phothilimthana, Shen Wang, Anna, Goldie, Azalia Mirhoseini, and James Laudon

TL;DR
This paper introduces a transferable deep reinforcement learning approach for graph optimization in ML compilers, achieving significant speedups and improvements over existing methods and human experts across various complex graph tasks.
Contribution
It presents a novel end-to-end, scalable graph neural network-based reinforcement learning method that generalizes to unseen graphs and optimizes entire computational graphs efficiently.
Findings
33%-60% speedup on three optimization tasks
21% average improvement over human experts
18% improvement over prior state-of-the-art methods
Abstract
Most compilers for machine learning (ML) frameworks need to solve many correlated optimization problems to generate efficient machine code. Current ML compilers rely on heuristics based algorithms to solve these optimization problems one at a time. However, this approach is not only hard to maintain but often leads to sub-optimal solutions especially for newer model architectures. Existing learning based approaches in the literature are sample inefficient, tackle a single optimization problem, and do not generalize to unseen graphs making them infeasible to be deployed in practice. To address these limitations, we propose an end-to-end, transferable deep reinforcement learning method for computational graph optimization (GO), based on a scalable sequential attention mechanism over an inductive graph neural network. GO generates decisions on the entire graph rather than on each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Big Data and Digital Economy · Graph Theory and Algorithms
MethodsLinear Layer · Cosine Annealing · Average Pooling · Label Smoothing · 1x1 Convolution · Auxiliary Classifier · Inception-v3 Module · Max Pooling · Variational Dropout · Adam
