Parallel FFTW on RISC-V: A Comparative Study including OpenMP, MPI, and HPX
Alexander Strack, Christopher Taylor, and Dirk Pfl\"uger

TL;DR
This study evaluates the parallel performance of FFTW on RISC-V processors using MPI, OpenMP, and HPX, comparing it to x86-64, and highlights the challenges and potential of RISC-V for large-scale parallel computing.
Contribution
It provides a comparative analysis of FFTW's parallel scaling on RISC-V versus x86-64, including memory optimization effects and the performance of different parallelization strategies.
Findings
FFT performance on RISC-V is about one-eighth of x86-64 for double-precision 2D FFT.
MPI scaling is effective up to 64 cores on both architectures.
OpenMP scaling requires specific planning for optimal performance on RISC-V.
Abstract
Rapid advancements in RISC-V hardware development shift the focus from low-level optimizations to higher-level parallelization. Recent RISC-V processors, such as the SOPHON SG2042, have 64 cores. RISC-V processors with core counts comparable to the SG2042, make efficient parallelization as crucial for RISC-V as the more established processors such as x86-64. In this work, we evaluate the parallel scaling of the widely used FFTW library on RISC-V for MPI and OpenMP. We compare it to a 64-core AMD EPYC 7742 CPU side by side for different types of FFTW planning. Additionally, we investigate the effect of memory optimization on RISC-V in HPX-FFT, a parallel FFT library based on the asynchronous many-task runtime HPX using an FFTW backend. We generally observe a performance delta between the x86-64 and RISC-V chips of factor eight for double-precision 2D FFT. Effective memory optimizations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Digital Filter Design and Implementation · Computational Physics and Python Applications
