GPU acceleration and performance of the particle-beam-dynamics code   Elegant

J.R. King; I.V. Pogorelov; K.M. Amyx; M. Borland; and R. Soliday

arXiv:1710.07350·physics.comp-ph·November 22, 2018·Comput. Phys. Commun.

GPU acceleration and performance of the particle-beam-dynamics code Elegant

J.R. King, I.V. Pogorelov, K.M. Amyx, M. Borland, and R. Soliday

PDF

TL;DR

This paper presents a GPU-accelerated version of the Elegant particle-beam dynamics code, achieving 6-10x performance improvements on supercomputers while maintaining accuracy and ease of maintenance.

Contribution

The paper introduces a GPU framework for Elegant that significantly enhances simulation performance and ensures accuracy, simplifying implementation of core kernel types.

Findings

01

GPU version achieves 6-10x speedup on Titan supercomputer.

02

Framework simplifies implementation of core kernels.

03

Accuracy matches CPU-based results across multiple methods.

Abstract

Elegant is an accelerator physics and particle-beam dynamics code widely used for modeling and design of a variety of high-energy particle accelerators and accelerator-based systems. In this paper we discuss a recently developed version of the code that can take advantage of CUDA-enabled graphics processing units (GPUs) to achieve significantly improved performance for a large class of simulations that are important in practice. The GPU version is largely defined by a framework that simplifies implementations of the fundamental kernel types that are used by Elegant: particle operations, reductions, particle loss, histograms, array convolutions and random number generation. Accelerated performance on the Titan Cray XK-7 supercomputer is approximately 6-10 times better with the GPU than all the CPU cores associated with the same node count. In addition to performance, the maintainability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.