TL;DR
This paper enhances the Corrfunc software by implementing AVX512F SIMD kernels, achieving significant speedups in correlation function computations crucial for astrophysics and cosmology.
Contribution
It introduces manually optimized AVX512F SIMD kernels for Corrfunc, significantly improving computational speed over previous implementations and compiler-generated code.
Findings
Up to 4x speedup with AVX512F kernels over compiler code
Approximately 1.6x faster than AVX kernels
Additional 5-10% speedup through pair pruning techniques
Abstract
Correlation functions are widely used in extra-galactic astrophysics to extract insights into how galaxies occupy dark matter halos and in cosmology to place stringent constraints on cosmological parameters. A correlation function fundamentally requires computing pair-wise separations between two sets of points and then computing a histogram of the separations. Corrfunc is an existing open-source, high-performance software package for efficiently computing a multitude of correlation functions. In this paper, we will discuss the SIMD AVX512F kernels within Corrfunc, capable of processing 16 floats or 8 doubles at a time. The latest manually implemented Corrfunc AVX512F kernels show a speedup of up to relative to compiler-generated code for double-precision calculations. The AVX512F kernels show speedup relative to the AVX kernels and compare favorably to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
