Accelerating finite-rate chemical kinetics with coprocessors: comparing vectorization methods on GPUs, MICs, and CPUs
Christopher P. Stone, Andrew T. Alferman, Kyle E. Niemeyer

TL;DR
This paper compares vectorization methods on GPUs, MICs, and CPUs for accelerating chemical kinetics ODE solvers, revealing significant performance gains with SIMD and highlighting divergence issues affecting GPU performance.
Contribution
It provides a comprehensive performance comparison of SIMD and SIMT paradigms across multiple hardware platforms for chemical kinetics ODE integration.
Findings
SIMD on CPU and MIC achieved 2.5-2.7x speedup over baseline
GPU SIMT was 1.4-1.6x faster than baseline but slower than SIMD on CPU/MIC
Thread divergence due to adaptive step-sizes impacts GPU performance
Abstract
Efficient ordinary differential equation solvers for chemical kinetics must take into account the available thread and instruction-level parallelism of the underlying hardware, especially on many-core coprocessors, as well as the numerical efficiency. A stiff Rosenbrock and nonstiff Runge-Kutta solver are implemented using the single instruction, multiple thread (SIMT) and single instruction, multiple data (SIMD) paradigms with OpenCL. The performances of these parallel implementations were measured with three chemical kinetic models across several multicore and many-core platforms. Two runtime benchmarks were conducted to clearly determine any performance advantage offered by either method: evaluating the right-hand-side source terms in parallel, and integrating a series of constant-pressure homogeneous reactors using the Rosenbrock and Runge-Kutta solvers. The right-hand-side…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
