Managing Communication Latency-Hiding at Runtime for Parallel   Programming Languages and Libraries

Mads Ruben Burgdorff Kristensen; Brian Vinter

arXiv:1201.3804·cs.DC·May 16, 2013

Managing Communication Latency-Hiding at Runtime for Parallel Programming Languages and Libraries

Mads Ruben Burgdorff Kristensen, Brian Vinter

PDF

TL;DR

This paper presents a runtime model for latency-hiding in parallel programming, enabling interpreted languages to reduce communication wait times significantly through data dependency management and lazy evaluation.

Contribution

It introduces a novel runtime model and heuristic implementation in DistNumPy for automatic latency-hiding in interpreted parallel programs.

Findings

01

Reduced communication wait time by up to 27 times

02

Lowered communication wait from 54% to 2% of total execution time

03

Demonstrated effectiveness on eight benchmark applications

Abstract

This work introduces a runtime model for managing communication with support for latency-hiding. The model enables non-computer science researchers to exploit communication latency-hiding techniques seamlessly. For compiled languages, it is often possible to create efficient schedules for communication, but this is not the case for interpreted languages. By maintaining data dependencies between scheduled operations, it is possible to aggressively initiate communication and lazily evaluate tasks to allow maximal time for the communication to finish before entering a wait state. We implement a heuristic of this model in DistNumPy, an auto-parallelizing version of numerical Python that allows sequential NumPy programs to run on distributed memory architectures. Furthermore, we present performance comparisons for eight benchmarks with and without automatic latency-hiding. The results shows…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.