Hierarchical Mixture of Experts: Generalizable Learning for High-Level Synthesis
Weikai Li, Ding Wang, Zijian Ding, Atefeh Sohrabizadeh, Zongyue Qin,, Jason Cong, Yizhou Sun

TL;DR
This paper introduces a hierarchical Mixture of Experts model for high-level synthesis in FPGA design, improving domain generalization and performance prediction across diverse kernels using a multi-level learning approach.
Contribution
It proposes a novel hierarchical MoE structure with a two-stage training method, enhancing domain generalization in performance prediction for FPGA high-level synthesis.
Findings
Hierarchical MoE improves prediction accuracy across new kernels.
Two-stage training stabilizes expert specialization.
Model effectively captures multi-granularity program features.
Abstract
High-level synthesis (HLS) is a widely used tool in designing Field Programmable Gate Array (FPGA). HLS enables FPGA design with software programming languages by compiling the source code into an FPGA circuit. The source code includes a program (called "kernel") and several pragmas that instruct hardware synthesis, such as parallelization, pipeline, etc. While it is relatively easy for software developers to design the program, it heavily relies on hardware knowledge to design the pragmas, posing a big challenge for software developers. Recently, different machine learning algorithms, such as GNNs, have been proposed to automate the pragma design via performance prediction. However, when applying the trained model on new kernels, the significant domain shift often leads to unsatisfactory performance. We propose a more domain-generalizable model structure: a two-level hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning
MethodsMixture of Experts
