Exact Sparse Matrix-Vector Multiplication on GPU's and Multicore   Architectures

Brice Boyer (LJK); Jean-Guillaume Dumas (LJK); Pascal Giorgi (LIRMM)

arXiv:1004.3719·cs.DC·September 9, 2010

Exact Sparse Matrix-Vector Multiplication on GPU's and Multicore Architectures

Brice Boyer (LJK), Jean-Guillaume Dumas (LJK), Pascal Giorgi (LIRMM)

PDF

TL;DR

This paper presents optimized implementations of sparse matrix-vector multiplication on GPUs and multicore CPUs to enhance the performance of algebraic algorithms over finite fields, leveraging parallelization techniques.

Contribution

It introduces new GPU and multicore implementations of sparse matrix-vector multiplication and applies them to improve finite field algebraic algorithms within the LinBox library.

Findings

01

Significant speedup of sparse matrix-vector multiplication on GPU and multicore architectures.

02

Enhanced performance of black box algorithms over finite fields.

03

Parallelization of the sigma-basis algorithm in a block Wiedemann rank implementation.

Abstract

We propose different implementations of the sparse matrix--dense vector multiplication (\spmv{}) for finite fields and rings $\Zb / m \Zb$ . We take advantage of graphic card processors (GPU) and multi-core architectures. Our aim is to improve the speed of \spmv{} in the \linbox library, and henceforth the speed of its black box algorithms. Besides, we use this and a new parallelization of the sigma-basis algorithm in a parallel block Wiedemann rank implementation over finite fields.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.