VUDA: Breaking CUDA-Vulkan Isolation for Spatial Sharing of Compute and Graphics on the Same GPU

Bin Xu; Pengfei Hu; Wenxin Zheng; Jinyu Gu; and Haibo Chen

arXiv:2605.01352·cs.OS·May 5, 2026

VUDA: Breaking CUDA-Vulkan Isolation for Spatial Sharing of Compute and Graphics on the Same GPU

Bin Xu, Pengfei Hu, Wenxin Zheng, Jinyu Gu, and Haibo Chen

PDF

TL;DR

VUDA enables concurrent execution of CUDA physics simulation and Vulkan rendering on the same GPU by breaking execution isolation, significantly improving throughput and GPU utilization.

Contribution

This work introduces VUDA, a system that allows spatial sharing of CUDA and Vulkan workloads by unifying execution channels and address spaces without data copying.

Findings

01

Up to 85% higher throughput compared to temporal-sharing baselines

02

Improved GPU utilization and reduced end-to-end latency

03

Enables concurrent physics simulation and rendering on a single GPU

Abstract

GPU-based simulation environments for embodied AI interleave physics simulation (CUDA) and photorealistic rendering (Vulkan) on a single device. We observe that two foundational scenarios -- simulation data generation and RL training -- can be naturally adapted to execute their simulation and rendering phases concurrently, presenting a significant opportunity to improve GPU utilization through spatial multiplexing. However, a fundamental obstacle we term execution isolation prevents this: CUDA and Vulkan create separate GPU contexts whose channels are bound to different scheduling groups, confining compute and graphics to mutually exclusive time slices. Existing spatial-sharing techniques are limited to the CUDA ecosystem, while temporal-sharing approaches underutilize available resources. This paper presents VUDA, a system that breaks execution isolation to enable spatial parallelism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.