Equal bi-Vectorized (EBV) method to high performance on GPU
Amirreza Hashemi, Mohsen Lahooti, Ebrahim Shirani

TL;DR
This paper introduces the Equal bi-Vectorized (EBV) method, a GPU-optimized algorithm for LU decomposition that enhances performance through bi-vectorization and vector equalization, suitable for dense and sparse matrices.
Contribution
The paper presents a novel bi-vectorization and equalization approach to improve LU decomposition efficiency on GPUs, applicable to various parallelism schemes and multi-device setups.
Findings
Demonstrates performance improvements over existing methods
Effective for both dense and sparse matrices
Compatible with multiple parallelism strategies
Abstract
Due to importance of reducing of time solution in numerical codes, we propose an algorithm for parallel LU decomposition solver for dense and sparse matrices on GPU. This algorithm is based on first bi-vectorizing a triangular matrices of decomposed coefficient matrix and then equalizing vectors. So we improve performance of LU decomposition on equal contributed scheme on threads. This algorithm also is convenient for other parallelism method and multi devices. Several test cases show advantage of this method over other familiar method.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Matrix Theory and Algorithms · Embedded Systems Design Techniques
