QR Factorization of Tall and Skinny Matrices in a Grid Computing   Environment

Emmanuel Agullo; Camille Coti; Jack Dongarra; Thomas Herault; and; Julien Langou

arXiv:0912.2572·cs.DC·November 15, 2016

QR Factorization of Tall and Skinny Matrices in a Grid Computing Environment

Emmanuel Agullo, Camille Coti, Jack Dongarra, Thomas Herault, and, Julien Langou

PDF

TL;DR

This paper introduces a topology-aware QR factorization method for tall and skinny matrices in grid environments, significantly improving performance over traditional methods by reducing communication bottlenecks.

Contribution

The paper combines a communication-avoiding QR algorithm with topology-aware middleware to enhance distributed performance in grid computing environments.

Findings

01

Performance increases linearly with the number of sites.

02

Method outperforms traditional ScaLAPACK in large-scale problems.

03

Reduces communication bottlenecks in distributed QR computations.

Abstract

Previous studies have reported that common dense linear algebra operations do not achieve speed up by using multiple geographical sites of a computational grid. Because such operations are the building blocks of most scientific applications, conventional supercomputers are still strongly predominant in high-performance computing and the use of grids for speeding up large-scale scientific problems is limited to applications exhibiting parallelism at a higher level. We have identified two performance bottlenecks in the distributed memory algorithms implemented in ScaLAPACK, a state-of-the-art dense linear algebra library. First, because ScaLAPACK assumes a homogeneous communication network, the implementations of ScaLAPACK algorithms lack locality in their communication pattern. Second, the number of messages sent in the ScaLAPACK algorithms is significantly greater than other algorithms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.