Active deep learning method for the discovery of objects of interest in large spectroscopic surveys
Petr \v{S}koda (1, 2), Ond\v{r}ej Podsztavek (2), Pavel Tvrd\'ik, (2) ((1) Astronomical Institute of the Czech Academy of Sciences, (2) Faculty, of Information Technology of the Czech Technical University in Prague)

TL;DR
This paper presents an active deep learning approach using convolutional neural networks to automatically identify emission-line objects in large spectroscopic surveys, significantly aiding the discovery of rare astronomical objects.
Contribution
It introduces a novel active learning method with a custom 1D CNN for spectral classification, improving detection accuracy and discovering new emission-line objects in massive datasets.
Findings
Successfully identified emission-line stars with less than 6.5% error.
Discovered 948 new emission-line object candidates in LAMOST spectra.
Enhanced spectral classification accuracy with active learning and deep CNNs.
Abstract
Current archives of the LAMOST telescope contain millions of pipeline-processed spectra that have probably never been seen by human eyes. Most of the rare objects with interesting physical properties, however, can only be identified by visual analysis of their characteristic spectral features. A proper combination of interactive visualisation with modern machine learning techniques opens new ways to discover such objects. We apply active learning classification supported by deep convolutional networks to automatically identify complex emission-line shapes in multi-million spectra archives. We used the pool-based uncertainty sampling active learning driven by a custom-designed deep convolutional neural network with 12 layers inspired by VGGNet, AlexNet, and ZFNet, but adapted for one-dimensional feature vectors. The unlabelled pool set is represented by 4.1 million spectra from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Max Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Local Contrast Normalization · Dropout · Dense Connections · ZFNet
