Exoplanet Validation with Machine Learning: 50 new validated Kepler planets
David J. Armstrong, Jevgenij Gamper, Theodoros Damoulas

TL;DR
This paper introduces a machine learning approach, specifically a Gaussian process classifier, for validating exoplanets, successfully confirming 50 new Kepler planets and offering a faster, automated alternative to traditional methods like vespa.
Contribution
The study demonstrates the effectiveness of machine learning, particularly GPC, for probabilistic exoplanet validation, providing a scalable method adaptable to large datasets like TESS.
Findings
Validated 50 new Kepler planets using ML methods
Achieved rapid validation of thousands of candidates
Identified discrepancies with traditional vespa validation
Abstract
Over 30% of the ~4000 known exoplanets to date have been discovered using 'validation', where the statistical likelihood of a transit arising from a false positive (FP), non-planetary scenario is calculated. For the large majority of these validated planets calculations were performed using the vespa algorithm (Morton et al. 2016). Regardless of the strengths and weaknesses of vespa, it is highly desirable for the catalogue of known planets not to be dependent on a single method. We demonstrate the use of machine learning algorithms, specifically a gaussian process classifier (GPC) reinforced by other models, to perform probabilistic planet validation incorporating prior probabilities for possible FP scenarios. The GPC can attain a mean log-loss per sample of 0.54 when separating confirmed planets from FPs in the Kepler threshold crossing event (TCE) catalogue. Our models can validate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGaussian Process
