High Throughput Multidimensional Tridiagonal Systems Solvers on FPGAs
Kamalavasan Kamalakkannan, Istvan Z. Reguly, Suhaib A. Fahmy, Gihan R., Mudalige

TL;DR
This paper introduces a high-performance FPGA library for solving multi-dimensional tridiagonal systems, achieving significant speed and energy efficiency improvements over existing solutions and providing a comprehensive design space exploration.
Contribution
The paper presents a novel FPGA-based solver library with optimizations for multi-dimensional systems, including an analytic performance model and real-world application demonstrations.
Findings
Order of magnitude performance improvement over Xilinx's current library
Achieves over 85% predictive model accuracy in applications
FPGA solutions outperform GPU in runtime and energy efficiency
Abstract
We present a design space exploration for synthesizing optimized, high-throughput implementations of multiple multi-dimensional tridiagonal system solvers on FPGAs. Re-evaluating the characteristics of algorithms for the direct solution of tridiagonal systems, we develop a new tridiagonal solver library aimed at implementing high-performance computing applications on Xilinx FPGA hardware. Key new features of the library are (1) the unification of standard state-of-the-art techniques for implementing implicit numerical solvers with a number of novel high-gain optimizations such as vectorization and batching, motivated by multi-dimensional systems in real-world applications, (2) data-flow techniques that provide application specific optimizations for both 2D and 3D problems, including integration of explicit loops commonplace in real workloads, and (3) the development of an analytic model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Matrix Theory and Algorithms · VLSI and FPGA Design Techniques
