Pushing the Limit: A Hybrid Parallel Implementation of the Multi-resolution Approximation for Massive Data
Huang Huang, Lewis R. Blake, Dorit M. Hammerling

TL;DR
This paper presents a hybrid parallel C++ implementation of the multi-resolution approximation method for Gaussian processes, enabling efficient likelihood-based inference on massive spatial datasets with up to 47 million observations.
Contribution
The paper introduces a hybrid parallel implementation combining MPI and OpenMP for MRA, significantly improving scalability and inference speed for large spatial data sets.
Findings
Parallel implementation achieves faster inference on large datasets.
C++ code outperforms MATLAB version on small datasets.
Practical for real-world massive spatial data analysis.
Abstract
The multi-resolution approximation (MRA) of Gaussian processes was recently proposed to conduct likelihood-based inference for massive spatial data sets. An advantage of the methodology is that it can be parallelized. We implemented the MRA in C++ for both serial and parallel versions. In the parallel implementation, we use a hybrid parallelism that employs both distributed and shared memory computing for communications between and within nodes by using the Message Passing Interface (MPI) and OpenMP, respectively. The performance of the serial code is compared between the C++ and MATLAB implementations over a small data set on a personal laptop. The C++ parallel program is further carefully studied under different configurations by applications to data sets from around a tenth of a million to 47 million observations. We show the practicality of this implementation by demonstrating that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoil Geostatistics and Mapping · Gaussian Processes and Bayesian Inference · Geochemistry and Geologic Mapping
