Extracting Knowledge From Massive Astronomical Data Sets
Massimo Brescia, Stefano Cavuoti, S. G. Djorgovski, Ciro Donalek,, Giuseppe Longo, Maurizio Paolillo

TL;DR
This paper discusses the challenges of applying data mining techniques to massive astronomical datasets and introduces the DAME web application for knowledge discovery, demonstrating its use in galaxy and AGN classification.
Contribution
It presents the DAME web application tailored for data mining in astronomy and illustrates its application to galaxy and active galactic nuclei classification.
Findings
DAME effectively identifies candidate globular clusters.
DAME successfully classifies active galactic nuclei.
Tools like DAME are essential for data-intensive astronomy in the future.
Abstract
The exponential growth of astronomical data collected by both ground based and space borne instruments has fostered the growth of Astroinformatics: a new discipline laying at the intersection between astronomy, applied computer science, and information and computation (ICT) technologies. At the very heart of Astroinformatics is a complex set of methodologies usually called Data Mining (DM) or Knowledge Discovery in Data Bases (KDD). In the astronomical domain, DM/KDD are still in a very early usage stage, even though new methods and tools are being continuously deployed in order to cope with the Massive Data Sets (MDS) that can only grow in the future. In this paper, we briefly outline some general problems encountered when applying DM/KDD methods to astrophysical problems, and describe the DAME (DAta Mining & Exploration) web application. While specifically tailored to work on MDS,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
