SGPRS: Seamless GPU Partitioning Real-Time Scheduler for Periodic Deep   Learning Workloads

Amir Fakhim Babaei; Thidapat Chantem

arXiv:2406.09425·cs.DC·June 17, 2024

SGPRS: Seamless GPU Partitioning Real-Time Scheduler for Periodic Deep Learning Workloads

Amir Fakhim Babaei, Thidapat Chantem

PDF

Open Access

TL;DR

This paper introduces SGPRS, a novel real-time GPU scheduler that enables seamless partition switching for deep learning workloads, improving deadline adherence and overall GPU utilization.

Contribution

SGPRS is the first scheduler to consider zero-configuration partition switching in real-time GPU scheduling for deep neural network workloads.

Findings

01

SGPRS improves deadline satisfaction for parallel GPU tasks.

02

It sustains performance beyond the typical resource pivot point.

03

The scheduler enables more efficient GPU utilization in real-time DNN applications.

Abstract

Deep Neural Networks (DNNs) are useful in many applications, including transportation, healthcare, and speech recognition. Despite various efforts to improve accuracy, few works have studied DNN in the context of real-time requirements. Coarse resource allocation and sequential execution in existing frameworks result in underutilization. In this work, we conduct GPU speedup gain analysis and propose SGPRS, the first real-time GPU scheduler considering zero configuration partition switch. The proposed scheduler not only meets more deadlines for parallel tasks but also sustains overall performance beyond the pivot point.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Embedded Systems Design Techniques · Digital Image Processing Techniques