Efficient Boys function evaluation using minimax approximation
Rasmus Vikhamar-Sandberg, Michal Repisky

TL;DR
This paper introduces a fast, GPU-optimized algorithm for evaluating Boys functions using rational minimax approximations and recurrence relations, avoiding lookup tables and irregular memory access.
Contribution
It develops a novel method combining minimax approximations with recurrence relations tailored for high-throughput hardware architectures.
Findings
Achieves high accuracy with a maximum absolute error of 5e-14.
Provides specific approximation regions and coefficients for F0 to F32.
Optimized for GPU architectures with high throughput and low latency.
Abstract
We present an algorithm for efficient evaluation of Boys functions tailored to modern computing architectures, in particular graphical processing units (GPUs), where maximum throughput is high and data movement is costly. The method combines rational minimax approximations with upward and downward recurrence relations. The non-negative real axis is partitioned into three regions, , where regions and are treated using rational minimax approximations and region by an asymptotic approximation. This formulation avoids lookup tables and irregular memory access, making it well suited hardware with high maximum throughput and low latency. The rational minimax coefficients are generated using the rational Remez algorithm. For a target maximum absolute error of , the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPolynomial and algebraic computation · Numerical Methods and Algorithms · Formal Methods in Verification
