Teraflop per second gravitational lensing ray-shooting using graphics   processing units

Alexander C. Thompson; Christopher J. Fluke; David G. Barnes and; Benjamin R. Barsdell

arXiv:0905.2453·astro-ph.IM·September 28, 2009

Teraflop per second gravitational lensing ray-shooting using graphics processing units

Alexander C. Thompson, Christopher J. Fluke, David G. Barnes and, Benjamin R. Barsdell

PDF

TL;DR

This paper demonstrates a highly efficient GPU-based implementation of gravitational lensing ray-shooting, achieving teraflop performance and enabling billion-lens microlensing simulations on a single machine within a day.

Contribution

The authors develop a GPU-accelerated inverse ray-shooting method for gravitational lensing, achieving unprecedented performance and scalability with multiple GPUs.

Findings

01

Single GPU achieves 182 Gflop/s performance.

02

Multi-GPU system reaches 1.28 Tflop/s.

03

Billion-lens microlensing simulations completed in about a day.

Abstract

Gravitational lensing calculation using a direct inverse ray-shooting approach is a computationally expensive way to determine magnification maps, caustic patterns, and light-curves (e.g. as a function of source profile and size). However, as an easily parallelisable calculation, gravitational ray-shooting can be accelerated using programmable graphics processing units (GPUs). We present our implementation of inverse ray-shooting for the NVIDIA G80 generation of graphics processors using the NVIDIA Compute Unified Device Architecture (CUDA) software development kit. We also extend our code to multiple-GPU systems, including a 4-GPU NVIDIA S1070 Tesla unit. We achieve sustained processing performance of 182 Gflop/s on a single GPU, and 1.28 Tflop/s using the Tesla unit. We demonstrate that billion-lens microlensing simulations can be run on a single computer with a Tesla unit in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.