Optimizing Tensor Computation Graphs with Equality Saturation and Monte Carlo Tree Search
Jakob Hartmann, Guoliang He, Eiko Yoneki

TL;DR
This paper introduces a tensor graph rewriting method combining equality saturation and Monte Carlo tree search to optimize neural network inference speed, overcoming memory constraints and phase-ordering issues in graph optimization.
Contribution
It presents a novel tensor graph rewriting approach that uses Monte Carlo tree search to build better intermediate representations for neural network optimization.
Findings
Achieves up to 11% inference speedup over existing methods.
Introduces a new extraction algorithm for fast, accurate runtime estimates.
Addresses memory constraints in equality saturation-based optimization.
Abstract
The real-world effectiveness of deep neural networks often depends on their latency, thereby necessitating optimization techniques that can reduce a model's inference time while preserving its performance. One popular approach is to sequentially rewrite the input computation graph into an equivalent but faster one by replacing individual subgraphs. This approach gives rise to the so-called phase-ordering problem in which the application of one rewrite rule can eliminate the possibility to apply an even better one later on. Recent work has shown that equality saturation, a technique from compiler optimization, can mitigate this issue by first building an intermediate representation (IR) that efficiently stores multiple optimized versions of the input program before extracting the best solution in a second step. In practice, however, memory constraints prevent the IR from capturing all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Parallel Computing and Optimization Techniques · Computational Physics and Python Applications
