TL;DR
This paper introduces a parallel approximation method using Tile Low-Rank (TLR) techniques within the ExaGeoStat framework to efficiently perform maximum likelihood estimation for large-scale geostatistics, significantly reducing computational costs while maintaining accuracy.
Contribution
It extends the ExaGeoStat software to incorporate TLR approximation, enabling scalable, accurate large-scale geostatistical simulations on high-performance systems.
Findings
Achieved up to 13X speedup on shared-memory systems.
Achieved up to 5X speedup on distributed-memory systems.
Maintained prediction accuracy with large datasets (up to 2 million points).
Abstract
Maximum likelihood estimation is an important statistical technique for estimating missing data, for example in climate and environmental applications, which are usually large and feature data points that are irregularly spaced. In particular, the Gaussian log-likelihood function is the \emph{de facto} model, which operates on the resulting sizable dense covariance matrix. The advent of high performance systems with advanced computing power and memory capacity have enabled full simulations only for rather small dimensional climate problems, solved at the machine precision accuracy. The challenge for high dimensional problems lies in the computation requirements of the log-likelihood function, which necessitates storage and operations, where represents the number of given spatial locations. This prohibitive computational cost may be reduced by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
