Exploring Novel Data Storage Approaches for Large-Scale Numerical Weather Prediction
Nicolau Manubens Gil

TL;DR
This paper evaluates the performance of DAOS and Ceph object storage systems for large-scale Numerical Weather Prediction, demonstrating DAOS's superior scalability and flexibility compared to traditional POSIX file systems.
Contribution
The study develops new software adapters for ECMWF's NWP to utilize DAOS and Ceph, and provides comprehensive benchmarking comparing these systems to Lustre at scale.
Findings
DAOS outperforms Ceph and Lustre in scalability and flexibility.
Both DAOS and Ceph show excellent performance for NWP workloads.
Object storage systems like DAOS could see increased adoption in HPC environments.
Abstract
Driven by scientific and industry ambition, HPC and AI applications such as operational Numerical Weather Prediction (NWP) require processing and storing ever-increasing data volumes as fast as possible. Whilst POSIX distributed file systems and NVMe SSDs are currently a common HPC storage configuration providing I/O to applications, new storage solutions have proliferated or gained traction over the last decade with potential to address performance limitations POSIX file systems manifest at scale for certain I/O workloads. This work has primarily aimed to assess the suitability and performance of two object storage systems -namely DAOS and Ceph- for the ECMWF's operational NWP as well as for HPC and AI applications in general. New software-level adapters have been developed which enable the ECMWF's NWP to leverage these systems, and extensive I/O benchmarking has been conducted on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Cloud Computing and Resource Management · Parallel Computing and Optimization Techniques
