LenghuSky-8: An 8-Year All-Sky Cloud Dataset with Star-Aware Masks and Alt-Az Calibration for Segmentation and Nowcasting
Yicheng Rui, Xiao-Wei Duan, Licai Deng, Fan Yang, Zhengming Dang, Zhengjun Du, Junhao Peng, Wenhao Chu, Umut Mahmut, Kexin Li, Yiyun Wu, Fabo Feng

TL;DR
LenghuSky-8 is a comprehensive eight-year all-sky cloud dataset with star-aware masks and precise calibration, enabling improved segmentation and nowcasting for astronomical site monitoring.
Contribution
The paper introduces LenghuSky-8, a large-scale, multi-year all-sky dataset with star-aware masks and altitude-azimuth calibration, and benchmarks cloud segmentation and nowcasting methods.
Findings
Achieved 93.3% overall accuracy in cloud segmentation.
Calibrated pixel mapping with ~0.37° zenith and ~1.34° at 30° altitude.
ConvLSTM outperforms other models but shows limited near-term cloud prediction gains.
Abstract
Ground-based time-domain observatories require minute-by-minute, site-scale awareness of cloud cover, yet existing all-sky datasets are short, daylight-biased, or lack astrometric calibration. We present LenghuSky-8, an eight-year (2018-2025) all-sky imaging dataset from a premier astronomical site, comprising 429,620 frames with 81.2% night-time coverage, star-aware cloud masks, background masks, and per-pixel altitude-azimuth (Alt-Az) calibration. For robust cloud segmentation across day, night, and lunar phases, we train a linear probe on DINOv3 local features and obtain 93.3% 1.1% overall accuracy on a balanced, manually labeled set of 1,111 images. Using stellar astrometry, we map each pixel to local alt-az coordinates and measure calibration uncertainties of approximately 0.37 deg at zenith and approximately 1.34 deg at 30 deg altitude, sufficient for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGamma-ray bursts and supernovae · Impact of Light on Environment and Health · Remote Sensing in Agriculture
