Designing a 3D Parallel Memory-Aware Lattice Boltzmann Algorithm on Manycore Systems
Yuankun Fu, Fengguang Song

TL;DR
This paper presents innovative 3D memory-aware Lattice Boltzmann algorithms optimized for manycore systems, significantly improving performance over existing solutions by reducing memory bottlenecks and synchronization overhead.
Contribution
It introduces novel sequential and parallel 3D memory-aware LBM algorithms that enhance memory access efficiency and thread safety on manycore architectures.
Findings
Up to 89% performance improvement over Palabos software.
Effective reduction of synchronization in parallel LBM.
Enhanced memory access patterns for 3D LBM algorithms.
Abstract
Lattice Boltzmann method (LBM) is a promising approach to solving Computational Fluid Dynamics (CFD) problems, however, its nature of memory-boundness limits nearly all LBM algorithms' performance on modern computer architectures. This paper introduces novel sequential and parallel 3D memory-aware LBM algorithms to optimize its memory access performance. The introduced new algorithms combine the features of single-copy distribution, single sweep, swap algorithm, prism traversal, and merging two temporal time steps. We also design a parallel methodology to guarantee thread safety and reduce synchronizations in the parallel LBM algorithm. At last, we evaluate their performances on three high-end manycore systems and demonstrate that our new 3D memory-aware LBM algorithms outperform the state-of-the-art Palabos software (which realizes the Fuse Swap Prism LBM solver) by up to 89%.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
