VUDA: Breaking CUDA-Vulkan Isolation for Spatial Sharing of Compute and Graphics on the Same GPU
Bin Xu, Pengfei Hu, Wenxin Zheng, Jinyu Gu, and Haibo Chen

TL;DR
VUDA enables concurrent execution of CUDA physics simulation and Vulkan rendering on the same GPU by breaking execution isolation, significantly improving throughput and GPU utilization.
Contribution
This work introduces VUDA, a system that allows spatial sharing of CUDA and Vulkan workloads by unifying execution channels and address spaces without data copying.
Findings
Up to 85% higher throughput compared to temporal-sharing baselines
Improved GPU utilization and reduced end-to-end latency
Enables concurrent physics simulation and rendering on a single GPU
Abstract
GPU-based simulation environments for embodied AI interleave physics simulation (CUDA) and photorealistic rendering (Vulkan) on a single device. We observe that two foundational scenarios -- simulation data generation and RL training -- can be naturally adapted to execute their simulation and rendering phases concurrently, presenting a significant opportunity to improve GPU utilization through spatial multiplexing. However, a fundamental obstacle we term execution isolation prevents this: CUDA and Vulkan create separate GPU contexts whose channels are bound to different scheduling groups, confining compute and graphics to mutually exclusive time slices. Existing spatial-sharing techniques are limited to the CUDA ecosystem, while temporal-sharing approaches underutilize available resources. This paper presents VUDA, a system that breaks execution isolation to enable spatial parallelism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
