Contention-Aware GPU Partitioning and Task-to-Partition Allocation for   Real-Time Workloads

Houssam-Eddine Zahaf; Ignacio Sanudo Olmedo; Jayati Singh; Nicola; Capodieci; Sebastien Faucou

arXiv:2105.10312·cs.DC·May 24, 2021

Contention-Aware GPU Partitioning and Task-to-Partition Allocation for Real-Time Workloads

Houssam-Eddine Zahaf, Ignacio Sanudo Olmedo, Jayati Singh, Nicola, Capodieci, Sebastien Faucou

PDF

TL;DR

This paper introduces contention-aware GPU partitioning and task allocation techniques for real-time workloads, improving scheduling efficiency by considering kernel interference in multi-cluster GPU architectures.

Contribution

It proposes novel GPU partitioning and task-to-partition allocation methods that account for kernel interference, enhancing real-time scheduling on modern GPU architectures.

Findings

01

Proposed techniques outperform classical non-partitioned approaches

02

Effective interference quantification improves scheduling accuracy

03

Partitioning increases GPU utilization for real-time tasks

Abstract

In order to satisfy timing constraints, modern real-time applications require massively parallel accelerators such as General Purpose Graphic Processing Units (GPGPUs). Generation after generation, the number of computing clusters made available in novel GPU architectures is steadily increasing, hence, investigating suitable scheduling approaches is now mandatory. Such scheduling approaches are related to mapping different and concurrent compute kernels within the GPU computing clusters, hence grouping GPU computing clusters into schedulable partitions. In this paper we propose novel techniques to define GPU partitions; this allows us to define suitable task-to-partition allocation mechanisms in which tasks are GPU compute kernels featuring different timing requirements. Such mechanisms will take into account the interference that GPU kernels experience when running in overlapping time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.