Low-cost high performance distributed data storage for multi-channel observations
Ying-bo Liu, Feng Wang, Hui Deng, Kai-fan Ji, Wei Dai, Shou-lin Wei,, Bo Liang, Xiao-li Zhang

TL;DR
This paper presents a low-cost, high-performance distributed storage system for the NVST solar telescope, capable of handling multi-terabyte daily data with efficient access, balancing cost and performance.
Contribution
The study introduces the first distributed storage system tailored for real-time astronomical data, optimizing access performance through strategies based on concurrent access and file size.
Findings
The system effectively meets NVST's real-time data storage needs.
Performance is significantly influenced by concurrent read/write and file size.
Three storage strategies improve access performance under multi-host conditions.
Abstract
The New Vacuum Solar Telescope (NVST) is a 1-m solar telescope that aims to observe the fine structures in both the photosphere and the chromosphere of the Sun. The observational data acquired simultaneously from one channel for the chromosphere and two channels for the photosphere bring great challenges to the data storage of NVST. The multi-channel instruments of NVST, including scientific cameras and multi-band spectrometers, generate at least 3 terabytes data per day and require high access performance while storing massive short-exposure images. It is worth studying and implementing a storage system for NVST which would balance the data availability, access performance and the cost of development. In this paper, we build a distributed data storage system (DDSS) for NVST and then deeply evaluate the availability of real-time data storage on a distributed computing environment. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques
