Estimating Exoplanet Mass using Machine Learning on Incomplete Datasets
Florian Lalande, Elizabeth Tasker, Kenji Doya

TL;DR
This paper develops and compares machine learning methods, especially a new $k$NN×KDE algorithm, to estimate exoplanet masses from incomplete datasets, enabling broader and more confident predictions of planetary properties.
Contribution
Introduces a novel $k$NN×KDE algorithm for imputing exoplanet properties, capable of providing probability distributions and handling incomplete data effectively.
Findings
Imputation improves with more data, even if incomplete.
The $k$NN×KDE algorithm provides confidence measures for predictions.
Synthetic planet populations can be generated to explore planetary categories.
Abstract
The exoplanet archive is an incredible resource of information on the properties of discovered extrasolar planets, but statistical analysis has been limited by the number of missing values. One of the most informative bulk properties is planet mass, which is particularly challenging to measure with more than 70\% of discovered planets with no measured value. We compare the capabilities of five different machine learning algorithms that can utilize multidimensional incomplete datasets to estimate missing properties for imputing planet mass. The results are compared when using a partial subset of the archive with a complete set of six planet properties, and where all planet discoveries are leveraged in an incomplete set of six and eight planet properties. We find that imputation results improve with more data even when the additional data is incomplete, and allows a mass prediction for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAstronomy and Astrophysical Research · Astronomical Observations and Instrumentation
MethodsSparse Evolutionary Training
