ML-driven Hardware Cost Model for MLIR

Dibyendu Das; Sandya Mannarswamy

arXiv:2302.11405·cs.LG·February 23, 2023

ML-driven Hardware Cost Model for MLIR

Dibyendu Das, Sandya Mannarswamy

PDF

Open Access

TL;DR

This paper introduces a machine learning-based hardware cost model for high-level MLIR, enabling more accurate predictions of hardware utilization and performance metrics to optimize deep learning compilers.

Contribution

The paper develops a novel ML-driven cost model for high-level MLIR, leveraging NLP techniques to improve prediction accuracy over traditional static models.

Findings

01

Models provide reasonably good estimates with low error bounds.

02

Applicable to high-level MLIR from frameworks like PyTorch and TensorFlow.

03

Potential to guide compiler optimizations at graph and kernel levels.

Abstract

During early optimization passes, compilers must make predictions for machine-dependent characteristics such as execution unit utilization, number of register spills, latency, throughput etc. to generate better code. Often a hand-written static/analytical hardware cost model is built into the compiler. However, the need for more sophisticated and varied predictions has become more pronounced with the development of deep learning compilers which need to optimize dataflow graphs. Such compilers usually employ a much higher level MLIR form as an IR representation before lowering to traditional LLVM-IR. A static/analytical cost model in such a scenario is cumbersome and error prone as the opcodes represent very high level algebraic/arithmetic operations. Hence, we develop a machine learning-based cost model for high-level MLIR which can predict different target variables of interest such as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Cloud Computing and Resource Management · Scientific Computing and Data Management