Cross-matching Engine for Incremental Photometric Sky Survey

Ing. Ji\v{r}\'i N\'advorn\'ik

arXiv:1506.07208·cs.DB·June 25, 2015·1 cites

Cross-matching Engine for Incremental Photometric Sky Survey

Ing. Ji\v{r}\'i N\'advorn\'ik

PDF

Open Access

TL;DR

This paper presents a novel approach to generate light curves from existing astronomical data without pre-planned surveys, focusing on scalable clustering techniques for catalog creation.

Contribution

It introduces a scalable, performance-oriented clustering method for astronomical catalog generation that leverages relational and NoSQL databases, enabling data mining of existing survey data.

Findings

01

Evaluated multiple clustering solutions for performance and scalability.

02

Developed quality standards for catalog generation.

03

Compared relational databases with NoSQL and supercomputing approaches.

Abstract

For light curve generation, a pre-planned photometry survey is needed nowadays, where all of the exposure coordinates have to be given and don't change during the survey. This thesis shows it is not required and we can data-mine these light curves from astronomical data that was never meant for this purpose. With this approach, we can recycle all of the photometric surveys in the world and generate light curves of observed objects for them. This thesis is addressing mostly the catalog generation process, which is needed for creating the light curves. In practice, it focuses on one of the most important problems in astroinformatics which is clustering data volumes on Big Data scale where most of the traditional techniques stagger. We consider a wide variety of possible solutions from the view of performance, scalability, distributability, etc. We defined criteria for time and memory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Management and Algorithms · Time Series Analysis and Forecasting · Data Visualization and Analytics