Faster GPU-based convolutional gridding via thread coarsening

Bruce Merry

arXiv:1605.07023·astro-ph.IM·June 29, 2016·Astron. Comput.

Faster GPU-based convolutional gridding via thread coarsening

Bruce Merry

PDF

1 Repo

TL;DR

This paper introduces a GPU optimization technique called thread coarsening to significantly accelerate convolutional gridding in interferometric imaging, achieving up to 3.2x speedup on certain hardware.

Contribution

It applies thread coarsening to existing GPU algorithms, substantially improving their efficiency for convolutional gridding tasks.

Findings

01

Up to 3.2x performance gain on GTX 980

02

Up to 1.9x performance gain on GTX 980 for quad-polarization

03

Significant gains on Radeon R9 290X

Abstract

Convolutional gridding is a processor-intensive step in interferometric imaging. While it is possible to use graphics processing units (GPUs) to accelerate this operation, existing methods use only a fraction of the available flops. We apply thread coarsening to improve the efficiency of an existing algorithm, and observe performance gains of up to $3.2 \times$ for single-polarization gridding and $1.9 \times$ for quad-polarization gridding on a GeForce GTX 980, and smaller but still significant gains on a Radeon R9 290X.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ska-sa/thread-coarsening-grid-data
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.