Execution of Compound Multi-Kernel OpenCL Computations in Multi-CPU/Multi-GPU Environments
F\'abio Soldado, Fernando Alexandre, Herv\'e Paulino

TL;DR
This paper explores efficient scheduling and adaptation strategies for executing compound multi-kernel OpenCL computations across heterogeneous multi-CPU/multi-GPU systems, demonstrating performance improvements over GPU-only approaches.
Contribution
It introduces methods for scheduling and adapting multi-kernel OpenCL workloads on heterogeneous systems, addressing hardware configuration and workload variability challenges.
Findings
Performance gains with combined CPU and GPU execution
Effective workload scheduling strategies
Benefits of data-locality optimizations in CPU environments
Abstract
Current computational systems are heterogeneous by nature, featuring a combination of CPUs and GPUs. As the latter are becoming an established platform for high-performance computing, the focus is shifting towards the seamless programming of these hybrid systems as a whole. The distinct nature of the architectural and execution models in place raises several challenges, as the best hardware configuration is behaviour and workload dependent. In this paper, we address the execution of compound, multi-kernel, OpenCL computations in multi-CPU/multi-GPU environments. We address how these computations may be efficiently scheduled onto the target hardware, and how the system may adapt itself to changes in the workload to process and to fluctuations in the CPU's load. An experimental evaluation attests the performance gains obtained by the conjoined use of the CPU and GPU devices, when compared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
