A Hardware-based HEFT Scheduler Implementation for Dynamic Workloads on Heterogeneous SoCs
Alexander Fusco, Sahil Hassan, Joshua Mack, Ali Akoglu

TL;DR
This paper presents a hardware implementation of the HEFT scheduler on FPGA for heterogeneous SoCs, significantly reducing scheduling latency and increasing task throughput for dynamic workloads.
Contribution
It introduces a hardware-based HEFT_RT scheduler, demonstrating improved performance and latency reduction over software implementations on FPGA platforms.
Findings
Hardware HEFT_RT achieves 9.144 ns scheduling latency.
Processes 26.7% more tasks per second than software.
Reduces scheduling latency by up to 183 times.
Abstract
Non-uniform performance and power consumption across the processing elements (PEs) of heterogeneous SoCs increase the computation complexity of the task scheduling problem compared to homogeneous architectures. Latency of a software-based scheduler with the increased heterogeneity level in terms of number and types of PEs creates the necessity of deploying a scheduler as an overlay processor in hardware to be able to make scheduling decisions rapidly and enable deployment of real-life applications on heterogeneous SoCs. In this study we present the design trade-offs involved for implementing and deploying the runtime variant of the heterogeneous earliest finish time algorithm (HEFT_RT) on the FPGA. We conduct performance evaluations on a SoC configuration emulated over the Xilinx Zynq ZCU102 platform. In a runtime environment we demonstrate hardware-based HEFT_RT's ability to make…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Real-Time Systems Scheduling · Distributed and Parallel Computing Systems
