Explore as a Storm, Exploit as a Raindrop: On the Benefit of Fine-Tuning Kernel Schedulers with Coordinate Descent
Michael Canesche, Gaurav Verma, Fernando Magno Quintao Pereira

TL;DR
This paper introduces a method to improve kernel scheduling efficiency by integrating Droplet Search with coordinate descent into Ansor, reducing search time and enhancing kernel quality across various models and architectures.
Contribution
It presents a novel approach combining Droplet Search and coordinate descent within Ansor, significantly reducing search time while improving kernel quality in deep learning models.
Findings
Faster kernel optimization with better quality than traditional methods.
Effective across multiple architectures and deep learning models.
Validated improvements in Ansor and MetaSchedule frameworks.
Abstract
Machine-learning models consist of kernels, which are algorithms applying operations on tensors -- data indexed by a linear combination of natural numbers. Examples of kernels include convolutions, transpositions, and vectorial products. There are many ways to implement a kernel. These implementations form the kernel's optimization space. Kernel scheduling is the problem of finding the best implementation, given an objective function -- typically execution speed. Kernel optimizers such as Ansor, Halide, and AutoTVM solve this problem via search heuristics, which combine two phases: exploration and exploitation. The first step evaluates many different kernel optimization spaces. The latter tries to improve the best implementations by investigating a kernel within the same space. For example, Ansor combines kernel generation through sketches for exploration and leverages an evolutionary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Simulation Techniques and Applications · Real-Time Systems Scheduling
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Concatenated Skip Connection · Dense Block · Average Pooling · Softmax · Max Pooling · 1x1 Convolution · Dropout · Global Average Pooling
