Benchmarking a Proof-of-Concept Performance Portable SYCL-based Fast Fourier Transformation Library
Vincent R. Pascuzzi, Mehdi Goli

TL;DR
This paper introduces an early version of a SYCL-based FFT library designed for cross-platform compatibility across major hardware vendors, emphasizing portability and ease of development, with initial performance evaluations on various architectures.
Contribution
It presents a single-source, portable FFT library using SYCL, reducing complexity compared to existing solutions, and evaluates its performance across multiple hardware platforms.
Findings
Supports all major vendor hardware including CPUs and GPUs.
Demonstrates comparable performance to vendor-specific FFT libraries.
Identifies performance bottlenecks and areas for future improvement.
Abstract
In this paper, we present an early version of a SYCL-based FFT library, capable of running on all major vendor hardware, including CPUs and GPUs from AMD, ARM, Intel and NVIDIA. Although preliminary, the aim of this work is to seed further developments for a rich set of features for calculating FFTs. It has the advantage over existing portable FFT libraries in that it is single-source, and therefore removes the complexities that arise due to abundant use of pre-process macros and auto-generated kernels to target different architectures. We exercise two SYCL-enabled compilers, Codeplay ComputeCpp and Intel's open-source LLVM project, to evaluate performance portability of our SYCL-based FFT on various heterogeneous architectures. The current limitations of our library is it supports single-dimension FFTs up to in length and base-2 input sequences. We compare our results with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
