The JASMIN super-data-cluster
B. N. Lawrence, V. Bennett, J. Churchill, M. Juckes, P. Kershaw, P., Oliver, M. Pritchard, and A. Stephens

TL;DR
The JASMIN super-data-cluster is a large-scale infrastructure supporting climate and earth system data analysis, storage, and collaboration for UK, European, and international research communities.
Contribution
This paper describes the design, deployment, and capabilities of the JASMIN super-data-cluster, integrating storage, computation, and networking for climate and earth observation data.
Findings
Supports 9.3 PB of storage and 370+ cores
Provides reliable high-speed links to UK supercomputers
Enables collaboration across UK and European climate communities
Abstract
The JASMIN super-data-cluster is being deployed to support the data analysis requirements of the UK and European climate and earth system modelling community. Physical colocation of the core JASMIN resource with significant components of the facility for Climate and Environmental Monitoring from Space (CEMS) provides additional support for the earth observation community, as well as facilitating further comparison and evaluation of models with data. JASMIN and CEMS together centrally deploy 9.3 PB of storage - 4.6 PB of Panasas fast disk storage alongside the STFC Atlas Tape Store. Over 370 computing cores provide local computation. Remote JASMIN resources at Bristol, Leeds and Reading provide additional distributed storage and compute configured to support local workflow as a stepping stone to using the central JASMIN system. Fast network links from JASMIN provide reliable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Scientific Computing and Data Management · Advanced Data Storage Technologies
