OLLIE: Derivation-based Tensor Program Optimizer
Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei, Wang, Shizhi Tang, Lei Xie, Kezhao Huang, Zhihao Jia

TL;DR
OLLIE is a novel derivation-based tensor program optimizer that explores a broader expression space for DNNs, leading to significant performance improvements over existing methods.
Contribution
OLLIE introduces a derivation-based approach that enables optimization over general tensor algebra expressions, expanding the search space beyond fixed operator sets.
Findings
Up to 2.73× performance improvement on A100 GPU
Average 1.46× speedup across tested DNNs
Effective optimization on multiple GPU architectures
Abstract
Boosting the runtime performance of deep neural networks (DNNs) is critical due to their wide adoption in real-world tasks. Existing approaches to optimizing the tensor algebra expression of a DNN only consider expressions representable by a fixed set of predefined operators, missing possible optimization opportunities between general expressions. We propose OLLIE, the first derivation-based tensor program optimizer. OLLIE optimizes tensor programs by leveraging transformations between general tensor algebra expressions, enabling a significantly larger expression search space that includes those supported by prior work as special cases. OLLIE uses a hybrid derivation-based optimizer that effectively combines explorative and guided derivations to quickly discover highly optimized expressions. Evaluation on seven DNNs shows that OLLIE can outperform existing optimizers by up to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Tensor decomposition and applications · Advanced Neural Network Applications
