A Novel, Fully Automated Pipeline for Period Estimation in the EROS 2 Data Set
Pavlos Protopapas, Pablo Huijse, Pablo A. Estevez, Pablo, Zegers, Jose C. Principe

TL;DR
This paper introduces a fully automated pipeline that accurately discriminates periodic from non-periodic lightcurves in large datasets, using a novel kernel and similarity measure, achieving high precision and efficiency on synthetic and real data.
Contribution
The paper presents a new method with a periodic kernel and information-theoretic similarity measure for period estimation, optimized for large-scale lightcurve analysis.
Findings
Achieved 90% completeness and 95% precision on synthetic data.
Analyzed 32.8 million lightcurves in 18 hours using GPU clusters.
Estimated periodic source fractions in LMC and SMC.
Abstract
We present a new method to discriminate periodic from non-periodic irregularly sampled lightcurves. We introduce a periodic kernel and maximize a similarity measure derived from information theory to estimate the periods and a discriminator factor. We tested the method on a dataset containing 100,000 synthetic periodic and non-periodic lightcurves with various periods, amplitudes and shapes generated using a multivariate generative model. We correctly identified periodic and non-periodic lightcurves with a completeness of 90% and a precision of 95%, for lightcurves with a signal-to-noise ratio (SNR) larger than 0.5. We characterize the efficiency and reliability of the model using these synthetic lightcurves and applied the method on the EROS-2 dataset. A crucial consideration is the speed at which the method can be executed. Using hierarchical search and some simplification on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
