Decoupling GPU Programming Models from Resource Management for Enhanced Programming Ease, Portability, and Performance
Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan,, Ashish Shrestha, Saugata Ghose, Adwait Jog, Phillip B. Gibbons, Onur Mutlu

TL;DR
Zorua introduces a resource virtualization framework that decouples GPU resource specifications from hardware allocation, enhancing programming ease, portability, and performance through transparent virtualization and dynamic resource management.
Contribution
This paper presents Zorua, a novel framework that virtualizes GPU resources to decouple software specifications from hardware management, improving ease of programming, portability, and performance.
Findings
Zorua enables resource virtualization, providing the illusion of more resources than physically available.
It improves programming ease by reducing the need for resource tuning.
It enhances portability by minimizing re-tuning across GPU generations.
Abstract
The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of modern GPU programming models. This specification determines the parallelism, and hence performance, of the application during execution because the corresponding on-chip hardware resources are allocated and managed based on this specification. This tight-coupling between the software-provided resource specification and resource management in hardware leads to significant challenges in programming ease, portability, and performance. Zorua is a new resource virtualization framework, that decouples the programmer-specified resource usage of a GPU application from the actual allocation in the on-chip hardware resources. Zorua enables this decoupling by virtualizing each resource transparently to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems
