High-Performance Statistical Computing (HPSC): Challenges, Opportunities, and Future Directions
Sameh Abdulah, Mary Lai O. Salvana, Ying Sun, David E. Keyes, and Marc G. Genton

TL;DR
This paper discusses the emergence of high-performance statistical computing (HPSC), highlighting challenges, opportunities, and a roadmap for integrating statistical methods with high-performance computing platforms to advance scientific research.
Contribution
It provides a historical overview, identifies key challenges, and proposes a vision and roadmap for developing a dedicated HPSC community bridging statistics and HPC.
Findings
HPSC is an emerging field with significant potential.
Bridging SC and HPC communities can accelerate statistical applications.
A roadmap for developing HPSC community is proposed.
Abstract
We recognize the emergence of a statistical computing community focused on working with large computing platforms and producing software and applications that exemplify high-performance statistical computing (HPSC). The statistical computing (SC) community develops software that is widely used across disciplines. However, it remains largely absent from the high-performance computing (HPC) landscape, particularly on platforms such as those featured on the Top500 or Green500 lists. Many disciplines already participate in HPC, mostly centered around simulation science, although data-focused efforts under the artificial intelligence (AI) label are gaining popularity. Bridging this gap requires both community adaptation and technical innovation to align statistical methods with modern HPC technologies. We can accelerate progress in fast and scalable statistical applications by building…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Data Analysis with R · Gaussian Processes and Bayesian Inference
