Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management
Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan,, Ashish Shrestha, Saugata Ghose, Phillip B. Gibbons, Onur Mutlu

TL;DR
Zorua is a GPU resource virtualization framework that decouples software-specified resource parameters from hardware allocation, improving programming ease, portability, and performance by dynamically managing resources.
Contribution
This work introduces Zorua, a novel resource virtualization framework that separates resource specification from hardware management in GPUs, enhancing usability and efficiency.
Findings
Zorua improves programming ease by reducing resource tuning efforts.
Zorua enhances portability across GPU generations without re-tuning.
Zorua maintains or boosts application performance through dynamic resource management.
Abstract
The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of the existing GPU programming models. This specification determines the performance of the application during execution because the corresponding on-chip hardware resources are allocated and managed purely based on this specification. This tight coupling between the software-provided resource specification and resource management in hardware leads to significant challenges in programming ease, portability, and performance, as we demonstrate in this work. Our goal in this work is to reduce the dependence of performance on the software-provided resource specification to simultaneously alleviate the above challenges. To this end, we introduce Zorua, a new resource virtualization framework, that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Cloud Computing and Resource Management · Advanced Data Storage Technologies
