TL;DR
This paper presents optimizations for the EngineCL runtime to improve co-execution performance on commodity heterogeneous systems, focusing on reducing overheads and enhancing load balancing under time constraints.
Contribution
It introduces specific optimizations and an improved load balancing algorithm that significantly enhance performance and efficiency in heterogeneous system co-execution scenarios.
Findings
Optimization improvements of 7.5% and 17.4% for different offloading modes.
The new load balancing algorithm achieves an average efficiency of 0.84.
Optimizations reduce performance penalties in time-constrained co-execution.
Abstract
Heterogeneous systems are present from powerful supercomputers, to mobile devices, including desktop computers, thanks to their excellent performance and energy consumption. The ubiquity of these architectures in both desktop systems and medium-sized service servers allow enough variability to exploit a wide range of problems, such as multimedia workloads, video encoding, image filtering and inference in machine learning. Due to the heterogeneity, some efforts have been done to reduce the programming effort and preserve performance portability, but these systems include a set of challenges. The context in which applications offload the workload along with the management overheads introduced when doing co-execution, penalize the performance gains under time-constrained scenarios. Therefore, this paper proposes optimizations for the EngineCL runtime to reduce the penalization when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
