Hidet: Task-Mapping Programming Paradigm for Deep Learning Tensor Programs
Yaoyao Ding, Cody Hao Yu, Bojian Zheng, Yizhi Liu, Yida Wang, Gennady, Pekhimenko

TL;DR
Hidet introduces a task-mapping programming paradigm that embeds scheduling into tensor programs, enabling finer optimization control and automating fusion, significantly improving deep learning inference efficiency and reducing tuning time.
Contribution
The paper proposes a novel task-mapping programming paradigm for deep learning tensor programs, enhancing optimization expressiveness and automating fusion, with a new hardware-centric schedule space.
Findings
Hidet outperforms ONNX Runtime and TVM with AutoTVM and Ansor.
It reduces tuning time by up to 20x.
Experiments on convolution and transformer models show significant speedups.
Abstract
As deep learning models nowadays are widely adopted by both cloud services and edge devices, reducing the latency of deep learning model inferences becomes crucial to provide efficient model serving. However, it is challenging to develop efficient tensor programs for deep learning operators due to the high complexity of modern accelerators and the rapidly growing number of operators. Deep learning compilers, such as Apache TVM, adopt declarative scheduling primitives to lower the bar of developing tensor programs. However, we show that this approach is insufficient to cover state-of-the-art tensor program optimizations. In this paper, we propose to embed the scheduling process into tensor programs and use dedicated mappings, called task mappings, to define the computation assignment and ordering. This new approach greatly enriches the expressible optimizations by allowing developers to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Neural Network Applications · Tensor decomposition and applications
MethodsConvolution
