Transferable Graph Optimizers for ML Compilers

Yanqi Zhou; Sudip Roy; Amirali Abdolrashidi; Daniel Wong; Peter Ma,; Qiumin Xu; Hanxiao Liu; Phitchaya Mangpo Phothilimthana; Shen Wang; Anna; Goldie; Azalia Mirhoseini; and James Laudon

arXiv:2010.12438·cs.LG·February 23, 2021

Transferable Graph Optimizers for ML Compilers

Yanqi Zhou, Sudip Roy, Amirali Abdolrashidi, Daniel Wong, Peter Ma,, Qiumin Xu, Hanxiao Liu, Phitchaya Mangpo Phothilimthana, Shen Wang, Anna, Goldie, Azalia Mirhoseini, and James Laudon

PDF

Open Access 1 Video

TL;DR

This paper introduces a transferable deep reinforcement learning approach for graph optimization in ML compilers, achieving significant speedups and improvements over existing methods and human experts across various complex graph tasks.

Contribution

It presents a novel end-to-end, scalable graph neural network-based reinforcement learning method that generalizes to unseen graphs and optimizes entire computational graphs efficiently.

Findings

01

33%-60% speedup on three optimization tasks

02

21% average improvement over human experts

03

18% improvement over prior state-of-the-art methods

Abstract

Most compilers for machine learning (ML) frameworks need to solve many correlated optimization problems to generate efficient machine code. Current ML compilers rely on heuristics based algorithms to solve these optimization problems one at a time. However, this approach is not only hard to maintain but often leads to sub-optimal solutions especially for newer model architectures. Existing learning based approaches in the literature are sample inefficient, tackle a single optimization problem, and do not generalize to unseen graphs making them infeasible to be deployed in practice. To address these limitations, we propose an end-to-end, transferable deep reinforcement learning method for computational graph optimization (GO), based on a scalable sequential attention mechanism over an inductive graph neural network. GO generates decisions on the entire graph rather than on each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Transferable Graph Optimizers for ML Compilers· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Big Data and Digital Economy · Graph Theory and Algorithms

MethodsLinear Layer · Cosine Annealing · Average Pooling · Label Smoothing · 1x1 Convolution · Auxiliary Classifier · Inception-v3 Module · Max Pooling · Variational Dropout · Adam