Performance Optimization and Parallelization of a Parabolic Equation Solver in Computational Ocean Acoustics on Modern Many-core Computer
Min Xu, Yongxian Wang, Anthony Theodore Chronopoulos, Hao Yue

TL;DR
This paper presents a set of performance optimization and parallelization techniques for the FOR3D ocean acoustics solver, achieving significant speedup and scalability on modern many-core HPC systems.
Contribution
It introduces a hybrid OpenMP+MPI parallelization approach and various optimization strategies to enhance FOR3D's performance and extend its application to multi-frequency calculations.
Findings
Achieved 25.77x speedup on Tianhe-2 supercomputer.
Demonstrated good weak scalability of the parallel version.
Extended the application from single-frequency to multi-frequency calculations.
Abstract
As one of open-source codes widely used in computational ocean acoustics, FOR3D can provide a very good estimate for underwater acoustic propagation. In this paper, we propose a performance optimization and parallelization to speed up the running of FOR3D. We utilized a variety of methods to enhance the entire performance, such as using a multi-threaded programming model to exploit the potential capability of the many-core node of high-performance computing (HPC) system, tuning compile options, using efficient tuned mathematical library and utilizing vectorization optimization instruction. In addition, we extended the application from single-frequency calculation to multi-frequency calculation successfully by using OpenMP+MPI hybrid programming techniques on the mainstream HPC platform. A detailed performance evaluation was performed and the results showed that the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
