A Massive Data Parallel Computational Framework for Petascale/Exascale Hybrid Computer Systems
Marek Blazewicz, Steven R. Brandt, Peter Diener, David M. Koppelman,, Krzysztof Kurowski, Frank L\"offler, Erik Schnetter, Jian Tao

TL;DR
This paper introduces a new massively data parallel computational framework designed for petascale and exascale hybrid HPC systems, demonstrated through a high-performance 3D CFD code.
Contribution
It presents an alternative framework for developing scalable scientific applications on hybrid systems, improving performance over existing methods.
Findings
Successfully developed a 3D CFD code with improved performance
Demonstrated the framework's scalability on hybrid HPC systems
Provides a new approach for data parallel scientific computing
Abstract
Heterogeneous systems are becoming more common on High Performance Computing (HPC) systems. Even using tools like CUDA and OpenCL it is a non-trivial task to obtain optimal performance on the GPU. Approaches to simplifying this task include Merge (a library based framework for heterogeneous multi-core systems), Zippy (a framework for parallel execution of codes on multiple GPUs), BSGP (a new programming language for general purpose computation on the GPU) and CUDA-lite (an enhancement to CUDA that transforms code based on annotations). In addition, efforts are underway to improve compiler tools for automatic parallelization and optimization of affine loop nests for GPUs and for automatic translation of OpenMP parallelized codes to CUDA. In this paper we present an alternative approach: a new computational framework for the development of massively data parallel scientific codes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Parallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems
