Efficient Exascale Discretizations: High-Order Finite Element Methods
Tzanio Kolev, Paul Fischer, Misun Min, Jack Dongarra, Jed, Brown, Veselin Dobrev, Tim Warburton, Stanimire Tomov, Mark S., Shephard, Ahmad Abdelfattah, Valeria Barra, Natalie Beams and, Jean-Sylvain Camier, Noel Chalmers, Yohann Dudouit, Ali Karakus and, Ian Karlin

TL;DR
This paper discusses the development of high-order finite element methods optimized for exascale computing architectures, focusing on algorithms that maximize parallelism and minimize data movement for PDE discretizations.
Contribution
It presents the CEED project's efforts in creating next-generation discretization software and algorithms tailored for exascale hardware, emphasizing matrix-free methods and performance optimization.
Findings
Enhanced performance of finite element discretizations on GPU architectures.
Development of scalable, matrix-free linear solvers for large-scale PDEs.
Successful integration with exascale applications and benchmarks.
Abstract
Efficient exploitation of exascale architectures requires rethinking of the numerical algorithms used in many large-scale applications. These architectures favor algorithms that expose ultra fine-grain parallelism and maximize the ratio of floating point operations to energy intensive data movement. One of the few viable approaches to achieve high efficiency in the area of PDE discretizations on unstructured grids is to use matrix-free/partially-assembled high-order finite element methods, since these methods can increase the accuracy and/or lower the computational time due to reduced data motion. In this paper we provide an overview of the research and development activities in the Center for Efficient Exascale Discretizations (CEED), a co-design center in the Exascale Computing Project that is focused on the development of next-generation discretization software and algorithms to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
