Improving Multi-Instance GPU Efficiency via Sub-Entry Sharing TLB Design
Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer, Jaleel, Xulong Tang

TL;DR
This paper introduces STAR, a dynamic TLB sharing scheme for NVIDIA's MIG that significantly enhances address translation efficiency and application performance by optimizing TLB sub-entry utilization.
Contribution
STAR is a novel dynamic sharing mechanism for TLB sub-entries that improves multi-tenant GPU performance by reducing interference and increasing TLB utilization.
Findings
STAR achieves an average of 30.2% performance improvement.
It effectively reduces TLB contention among co-running applications.
The method dynamically adapts TLB sharing based on workload demands.
Abstract
NVIDIA's Multi-Instance GPU (MIG) technology enables partitioning GPU computing power and memory into separate hardware instances, providing complete isolation including compute resources, caches, and memory. However, prior work identifies that MIG does not extend to partitioning the last-level TLB (i.e., L3 TLB), which remains shared among all instances. To enhance TLB reach, NVIDIA GPUs reorganized the TLB structure with 16 sub-entries in each L3 TLB entry that have a one-to-one mapping to the address translations for 16 pages of size 64KB located within the same 1MB aligned range. Our comprehensive investigation of address translation efficiency in MIG identifies two main issues caused by L3 TLB sharing interference: (i) it results in performance degradation for co-running applications, and (ii) TLB sub-entries are not fully utilized before eviction. Based on this observation, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVLSI and Analog Circuit Testing · Advancements in Photolithography Techniques · Semiconductor materials and devices
