A decomposition method with minimum communication amount for parallelization of multi-dimensional FFTs
Truong Vinh Truong Duy, Taisuke Ozaki

TL;DR
This paper introduces an adaptive decomposition method for multi-dimensional FFTs that minimizes communication costs across various process counts by intelligently choosing transpose orders and data partitioning strategies.
Contribution
The proposed method adaptively decomposes multi-dimensional FFT data and optimizes transpose order to reduce communication, outperforming previous fixed-dimension approaches.
Findings
Achieves minimal communication for all process ranges
Identifies optimal transpose orders for 3D, 4D, and 5D FFTs
Enhances scalability and efficiency of parallel FFT implementations
Abstract
The fast Fourier transform (FFT) is undoubtedly an essential primitive that has been applied in various fields of science and engineering. In this paper, we present a decomposition method for parallelization of multi-dimensional FFTs with smallest communication amount for all ranges of the number of processes compared to previously proposed methods. This is achieved by two distinguishing features: adaptive decomposition and transpose order awareness. In the proposed method, the FFT data are decomposed based on a row-wise basis that maps the multi-dimensional data into one-dimensional data, and translates the corresponding coordinates from multi-dimensions into one-dimension so that the resultant one-dimensional data can be divided and allocated equally to the processes. As a result, differently from previous works that have the dimensions of decomposition pre-defined, our method can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical Methods and Algorithms · Parallel Computing and Optimization Techniques · Digital Filter Design and Implementation
