Assembly of FETI dual operator using CUDA
Jakub Homola (1), Radim Vav\v{r}\'ik (1), Ond\v{r}ej Meca (1), Tom\'a\v{s} Brzobohat\'y (1), Lubom\'ir \v{R}\'iha (1) ((1) IT4Innovations, VSB - Technical University of Ostrava)

TL;DR
This paper presents a GPU-accelerated method for assembling and applying the FETI dual operator, significantly reducing preprocessing time and achieving up to 25x speedup over CPU methods.
Contribution
The paper introduces GPU-based assembly of the FETI dual operator, enabling efficient explicit application with fewer iterations and substantial speedups.
Findings
Up to 10x faster preprocessing on GPU
Up to 25x faster dual operator application
Reduced number of iterations needed for speedup
Abstract
FETI is a numerical method used to solve engineering problems. It builds on the ideas of domain decomposition, which makes it highly scalable and capable of efficiently utilizing whole supercomputers. One of the most time-consuming parts of the FETI solver is the application of the dual operator F in every iteration of the solver. It is traditionally performed on the CPU using an implicit approach of applying the individual sparse matrices that form F right-to-left. Another approach is to apply the dual operator explicitly, which primarily involves a simple dense matrix-vector multiplication and can be efficiently performed on the GPU. However, this requires additional preprocessing on the CPU where the dense matrix is assembled, which makes the explicit approach beneficial only after hundreds of iterations are performed. In this paper, we use the GPU to accelerate the assembly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection
