pylspack: Parallel algorithms and data structures for sketching, column subset selection, regression and leverage scores
Aleksandros Sobczyk, Efstratios Gallopoulos

TL;DR
This paper introduces parallel algorithms and data structures for key linear algebra operations on tall-and-skinny matrices, improving scalability and performance in applications like regression and leverage scores computation.
Contribution
It presents novel parallel algorithms for sketching, matrix computations, and leverage scores, with a focus on efficient implementation for tall-and-skinny matrices.
Findings
Algorithms scale well with matrix size
Significant performance improvements over existing libraries
Effective handling of tall-and-skinny matrices
Abstract
We present parallel algorithms and data structures for three fundamental operations in Numerical Linear Algebra: (i) Gaussian and CountSketch random projections and their combination, (ii) computation of the Gram matrix and (iii) computation of the squared row norms of the product of two matrices, with a special focus on "tall-and-skinny" matrices, which arise in many applications. We provide a detailed analysis of the ubiquitous CountSketch transform and its combination with Gaussian random projections, accounting for memory requirements, computational complexity and workload balancing. We also demonstrate how these results can be applied to column subset selection, least squares regression and leverage scores computation. These tools have been implemented in pylspack, a publicly available Python package (https://github.com/IBM/pylspack) whose core is written in C++ and parallelized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Blind Source Separation Techniques · Face and Expression Recognition
