Runtime Support for Performance Portability on Heterogeneous Distributed Platforms
Polykarpos Thomadakis, Nikos Chrisochoides

TL;DR
This paper presents a runtime framework that simplifies programming for heterogeneous distributed systems, achieving significant performance improvements and scalability across diverse hardware configurations.
Contribution
It introduces a scalable runtime system that provides portable abstractions for heterogeneous hardware, enabling efficient performance portability and inter-node communication.
Findings
Up to 300% performance improvement on a single device.
Linear scalability on systems with four GPUs.
Performance gains of up to 20% over MPI+CUDA for large messages.
Abstract
Hardware heterogeneity is here to stay for high-performance computing. Large-scale systems are currently equipped with multiple GPU accelerators per compute node and are expected to incorporate more specialized hardware. This shift in the computing ecosystem offers many opportunities for performance improvement; however, it also increases the complexity of programming for such architectures. This work introduces a runtime framework that enables effortless programming for heterogeneous systems while efficiently utilizing hardware resources. The framework is integrated within a distributed and scalable runtime system to facilitate performance portability across heterogeneous nodes. Along with the design, this paper describes the implementation and optimizations performed, achieving up to 300% improvement on a single device and linear scalability on a node equipped with four GPUs. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Cloud Computing and Resource Management
