Rectangular Full Packed Format for Cholesky's Algorithm: Factorization, Solution and Inversion
Fred G. Gustavson, Jerzy Wasniewski, Jack J. Dongarra, Julien, Langou

TL;DR
This paper introduces the Rectangular Full Packed Format (RFPF), a new data format that combines the storage efficiency of packed format with the high performance of full format, enabling efficient Cholesky-related computations without new software.
Contribution
The paper presents RFPF, a novel data format that achieves high performance and minimal storage for symmetric and Hermitian matrices, compatible with existing LAPACK routines and Level 3 BLAS.
Findings
RFPF achieves comparable performance to full format with half the storage.
RFPF significantly outperforms packed routines, up to 97 times faster on SMP parallel systems.
RFPF simplifies implementation by requiring no new software, only routine adaptations.
Abstract
We describe a new data format for storing triangular, symmetric, and Hermitian matrices called RFPF (Rectangular Full Packed Format). The standard two dimensional arrays of Fortran and C (also known as full format) that are used to represent triangular and symmetric matrices waste nearly half of the storage space but provide high performance via the use of Level 3 BLAS. Standard packed format arrays fully utilize storage (array space) but provide low performance as there is no Level 3 packed BLAS. We combine the good features of packed and full storage using RFPF to obtain high performance via using Level 3 BLAS as RFPF is a standard full format representation. Also, RFPF requires exactly the same minimal storage as packed format. Each LAPACK full and/or packed triangular, symmetric, and Hermitian routine becomes a single new RFPF routine based on eight possible data layouts of RFPF.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Numerical Methods and Algorithms · Parallel Computing and Optimization Techniques
