Accelerating Implicit Finite Difference Schemes Using a Hardware Optimized Tridiagonal Solver for FPGAs
Samuel Palmer

TL;DR
This paper introduces a hardware-optimized FPGA implementation of the Thomas algorithm, significantly reducing computational complexity and latency for solving multiple tridiagonal systems in parallel, with applications in financial derivatives pricing.
Contribution
The paper presents a novel FPGA-based Thomas Core that accelerates tridiagonal system solutions by optimizing data flow and parallelism, reducing complexity and latency compared to traditional methods.
Findings
Reduces complexity from 8N to 5N operations
Halves latency by parallelizing divisions
Enables continuous parallel solving of multiple systems
Abstract
We present a design and implementation of the Thomas algorithm optimized for hardware acceleration on an FPGA, the Thomas Core. The hardware-based algorithm combined with the custom data flow and low level parallelism available in an FPGA reduces the overall complexity from 8N down to 5N serial arithmetic operations, and almost halves the overall latency by parallelizing the two costly divisions. Combining this with a data streaming interface, we reduce memory overheads to 2 N-length vectors per N-tridiagonal system to be solved. The Thomas Core allows for multiple independent tridiagonal systems to be continuously solved in parallel, providing an efficient and scalable accelerator for many numerical computations. Finally we present applications for derivatives pricing problems using implicit finite difference schemes on an FPGA accelerated system and we investigate the use and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPolynomial and algebraic computation · Numerical methods for differential equations · Matrix Theory and Algorithms
