Bigger Buffer k-d Trees on Multi-Many-Core Systems

Fabian Gieseke; Cosmin Eugen Oancea; Ashish Mahabal and; Christian Igel; Tom Heskes

arXiv:1512.02831·cs.DC·December 10, 2015

Bigger Buffer k-d Trees on Multi-Many-Core Systems

Fabian Gieseke, Cosmin Eugen Oancea, Ashish Mahabal and, Christian Igel, Tom Heskes

PDF

1 Repo

TL;DR

This paper extends buffer k-d trees to handle massive datasets across multiple devices, enabling efficient parallel nearest neighbor searches on large-scale data in astronomy.

Contribution

The authors modify buffer k-d trees and their workflow to support multi-device environments, facilitating scalable nearest neighbor searches on very large datasets.

Findings

01

Effective multi-device buffer k-d tree framework demonstrated

02

Significant speed-ups in astronomy data processing

03

Scalable approach for massive data sets

Abstract

A buffer k-d tree is a k-d tree variant for massively-parallel nearest neighbor search. While providing valuable speed-ups on modern many-core devices in case both a large number of reference and query points are given, buffer k-d trees are limited by the amount of points that can fit on a single device. In this work, we show how to modify the original data structure and the associated workflow to make the overall approach capable of dealing with massive data sets. We further provide a simple yet efficient way of using multiple devices given in a single workstation. The applicability of the modified framework is demonstrated in the context of astronomy, a field that is faced with huge amounts of data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gieseke/bufferkdtree
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.