P3DFFT: a framework for parallel computations of Fourier transforms in three dimensions
Dmitry Pekurovsky

TL;DR
P3DFFT is an efficient, scalable software package for 3D Fourier transforms that overcomes traditional bottlenecks using 2D domain decomposition, enabling high-performance computations on large parallel systems.
Contribution
The paper introduces P3DFFT, a scalable 3D FFT implementation that significantly improves performance on high-core-count systems through innovative domain decomposition techniques.
Findings
Achieves 45% efficiency in weak scaling from 128 to 65,536 cores on Cray XT5
Supports various transform types and data formats with flexible interfaces
Provides open-source software for high-performance 3D Fourier transforms
Abstract
Fourier and related transforms is a family of algorithms widely employed in diverse areas of computational science, notoriously difficult to scale on high-performance parallel computers with large number of processing elements (cores). This paper introduces a popular software package called P3DFFT implementing Fast Fourier Transforms (FFT) in three dimensions (3D) in a highly efficient and scalable way. It overcomes a well-known scalability bottleneck of 3D FFT implementations by using two-dimensional domain decomposition. Designed for portable performance, P3DFFT achieves excellent timings for a number of systems and problem sizes. On Cray XT5 system P3DFFT attains 45% efficiency in weak scaling from 128 to 65,536 computational cores. Library features include Fourier and Chebyshev transforms, Fortran and C interfaces, in- and out-of-place transforms, uneven data grids, single and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
