An investigation of GPU-based stiff chemical kinetics integration methods
Nicholas J. Curtis, Kyle E. Niemeyer, and Chih-Jen Sung

TL;DR
This paper evaluates GPU-based stiff chemical kinetics integration methods, comparing their performance to CPU-based methods, and discusses the impact of Jacobian computation techniques and hardware limitations on efficiency.
Contribution
It introduces GPU implementations of implicit and exponential integration methods with analytical Jacobians, analyzing their performance and identifying key bottlenecks.
Findings
GPU implicit Runge-Kutta matches 12-38 CPU cores in speed.
Thread divergence and memory traffic limit GPU performance.
Analytical Jacobians significantly outperform finite-difference Jacobians on GPU.
Abstract
A fifth-order implicit Runge-Kutta method and two fourth-order exponential integration methods equipped with Krylov subspace approximations were implemented for the GPU and paired with the analytical chemical kinetic Jacobian software pyJac. The performance of each algorithm was evaluated by integrating thermochemical state data sampled from stochastic partially stirred reactor simulations and compared with the commonly used CPU-based implicit integrator CVODE. We estimated that the implicit Runge-Kutta method running on a single GPU is equivalent to CVODE running on 12-38 CPU cores for integration of a single global integration time step of 1e-6 s with hydrogen and methane models. In the stiffest case studied---the methane model with a global integration time step of 1e-4 s---thread divergence and higher memory traffic significantly decreased GPU performance to the equivalent of CVODE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
