Accelerating the computation of FLAPW methods on heterogeneous   architectures

Davor Davidovi\'c; Diego Fabregat-Traver; Markus H\"ohnerbach; and; Edoardo di Napoli

arXiv:1712.07206·cs.DC·March 18, 2022

Accelerating the computation of FLAPW methods on heterogeneous architectures

Davor Davidovi\'c, Diego Fabregat-Traver, Markus H\"ohnerbach, and, Edoardo di Napoli

PDF

1 Repo

TL;DR

This paper demonstrates how re-engineering legacy FLEUR code enables efficient use of heterogeneous architectures like GPUs and Xeon Phis, achieving over 70% of peak performance and significant speedups.

Contribution

The paper presents a modular redesign of FLEUR that allows it to exploit heterogeneous architectures effectively, surpassing vendor libraries in performance.

Findings

01

Achieves over 70% of architecture peak performance.

02

Outperforms Nvidia and Intel libraries.

03

Attains 5x speedup on supercomputer JURECA.

Abstract

Legacy codes in computational science and engineering have been very successful in providing essential functionality to researchers. However, they are not capable of exploiting the massive parallelism provided by emerging heterogeneous architectures. The lack of portable performance and scalability puts them at high risk: either they evolve or they are doomed to disappear. One example of legacy code which would heavily benefit from a modern design is FLEUR, a software for electronic structure calculations. In previous work, the computational bottleneck of FLEUR was partially re-engineered to have a modular design that relies on standard building blocks, namely BLAS and LAPACK. In this paper, we demonstrate how the initial redesign enables the portability to heterogeneous architectures. More specifically, we study different approaches to port the code to architectures consisting of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SimLabQuantumMaterials/HybridHSDLA
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.