A Server-based Approach for Predictable GPU Access with Improved   Analysis

Hyoseung Kim; Pratyush Patel; Shige Wang; and Ragunathan (Raj); Rajkumar

arXiv:1709.06613·cs.DC·May 14, 2018·1 cites

A Server-based Approach for Predictable GPU Access with Improved Analysis

Hyoseung Kim, Pratyush Patel, Shige Wang, and Ragunathan (Raj), Rajkumar

PDF

Open Access

TL;DR

This paper introduces a server-based GPU management approach that ensures predictable access, reduces busy waiting and priority inversion, and improves task schedulability in real-time systems, demonstrated through a prototype on an embedded platform.

Contribution

The paper presents a novel server-based GPU management method that improves predictability and efficiency over traditional synchronization approaches in real-time systems.

Findings

01

Significant improvement in task schedulability with the server-based approach

02

Reduction in busy waiting and priority inversion issues

03

Practical implementation demonstrating effectiveness on embedded hardware

Abstract

We propose a server-based approach to manage a general-purpose graphics processing unit (GPU) in a predictable and efficient manner. Our proposed approach introduces a GPU server that is a dedicated task to handle GPU requests from other tasks on their behalf. The GPU server ensures bounded time to access the GPU, and allows other tasks to suspend during their GPU computation to save CPU cycles. By doing so, we address the two major limitations of the existing real-time synchronization-based GPU management approach: busy waiting and long priority inversion. We have implemented a prototype of the server-based approach on a real embedded platform. This case study demonstrates the practicality and effectiveness of the server-based approach. Experimental results indicate that the server-based approach yields significant improvements in task schedulability over the existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Real-Time Systems Scheduling · Distributed and Parallel Computing Systems