A Structure-Aware Framework for Learning Device Placements on Computation Graphs
Shukai Duan, Heng Ping, Nikos Kanakaris, Xiongye Xiao, Panagiotis, Kyriakis, Nesreen K. Ahmed, Peiyu Zhang, Guixiang Ma, Mihai Capota, Shahin, Nazarian, Theodore L. Willke, Paul Bogdan

TL;DR
This paper introduces a novel structure-aware framework for device placement on computation graphs, improving neural network inference speed by up to 60%, by combining graph coarsening, representation learning, and reinforcement learning.
Contribution
It bridges encoder-placer and grouper-placer methods, proposing a flexible, end-to-end trainable framework that considers the DAG structure for optimized device placement.
Findings
Achieves up to 58.2% speedup on CPU inference.
Demonstrates effectiveness on models like Inception-V3, ResNet, and BERT.
Shows robustness through ablation studies.
Abstract
Computation graphs are Directed Acyclic Graphs (DAGs) where the nodes correspond to mathematical operations and are used widely as abstractions in optimizations of neural networks. The device placement problem aims to identify optimal allocations of those nodes to a set of (potentially heterogeneous) devices. Existing approaches rely on two types of architectures known as grouper-placer and encoder-placer, respectively. In this work, we bridge the gap between encoder-placer and grouper-placer techniques and propose a novel framework for the task of device placement, relying on smaller computation graphs extracted from the OpenVINO toolkit. The framework consists of five steps, including graph coarsening, node representation learning and policy optimization. It facilitates end-to-end training and takes into account the DAG nature of the computation graphs. We also propose a model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsUsability and User Interface Design · Innovative Teaching and Learning Methods · Multimedia Communication and Technology
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · WordPiece · Linear Warmup With Linear Decay · Attention Dropout · Linear Layer · Convolution · Multi-Head Attention · Residual Connection
