A new model for archiving synoptic data in the VISTA Data Flow System
Nicholas Cross, Ross Collins, Nigel Hambly, Mike Read, Eckhard, Sutorius

TL;DR
This paper presents a novel, flexible, and scalable data archiving model for synoptic astronomical data in the VISTA Data Flow System, enhancing data queryability, quality control, and automation for large multi-epoch datasets.
Contribution
The authors introduce a new database schema and curation procedures tailored for deep and synoptic datasets, improving data management and analysis capabilities.
Findings
Enhanced data queryability through new database tables.
Improved quality control with automated curation procedures.
Scalable design suitable for future large-scale surveys.
Abstract
The VISTA Data Flow System comprises nightly pipeline and archiving of near infrared data from UKIRT-WFCAM and VISTA. This includes multi-epoch data which can be used to find moving and variable objects. We have developed a new model for archiving these data which gives the user an extremely flexible and reliable data set that is easy to query through an SQL interface. We have introduced several new database tables into our schema for deep/synoptic datasets. We have also developed a set of curation procedures, which give additional quality control and automation. We discuss the methods used and show some example data. Our design is particularly effective on correlated data-sets, where the observations in different filters are synchronised. It is scalable to large VISTA datasets which will be observed in the next few years and to future surveys such as Pan-STARRS and LSST.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Advanced Data Storage Technologies · Environmental Monitoring and Data Management
