Dynamic Hierarchical Birkhoff-von Neumann Decomposition for All-to-All GPU Communication
Yen-Chieh Wu, Cheng-Shang Chang, Duan-Shin Lee, H. Jonathan Chao

TL;DR
This paper introduces a dynamic hierarchical Birkhoff-von Neumann decomposition method for optimizing all-to-all GPU communication, effectively reducing bottlenecks caused by traffic skew and hierarchical network structures in large-scale GPU clusters.
Contribution
It presents a novel hierarchical BvN decomposition framework tailored for two-tier GPU fabrics, improving scheduling efficiency and stability in all-to-all GPU communication.
Findings
Significant reduction in mean frame length during simulations.
Enhanced stability under Poisson traffic arrivals.
Effective mitigation of traffic skew within servers.
Abstract
All-to-all GPU communication is a critical bottleneck in large-scale training clusters, where completion time is constrained by per-port bandwidth and can be severely impacted by traffic skew across GPUs and network interface cards (NICs). This issue is amplified by the two-tier structure of modern GPU systems, which combine fast intra-server links with much slower inter-server networks. Motivated by recent system observations that highlight the importance of traffic reshaping and hierarchy awareness, we study all-to-all scheduling from an online switching and queueing-theoretic perspective. We propose a dynamic hierarchical Birkhoff--von Neumann (BvN) decomposition framework tailored to two-tier GPU fabrics. At each frame boundary, traffic is first balanced within each server using simple local operations to mitigate micro-level GPU/NIC skew while preserving aggregate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Interconnection Networks and Systems · Cloud Computing and Resource Management
