Optimizing exoplanet atmosphere retrieval using unsupervised machine-learning classification
Joshua J.C. Hayes, E. Kerins, S. Awiphan, I. McDonald, J.S. Morgan, P., Chuanraksasat, S. Komonjinda, N. Sanguansak, and P. Kittara (SPEARNET)

TL;DR
This paper introduces an unsupervised machine learning method using PCA and k-means clustering to generate informed priors for exoplanet atmosphere retrieval, significantly speeding up the process while maintaining accuracy.
Contribution
The novel approach combines PCA and clustering to create informed priors, improving retrieval speed and efficiency over traditional uniform priors in exoplanet atmosphere analysis.
Findings
Classifier accurately assigns unseen spectra to correct classes at R=30-300
Informed priors reduce retrieval time by up to 50%
Method is applicable to real-world and future observational data
Abstract
One of the principal bottlenecks to atmosphere characterisation in the era of all-sky surveys is the availability of fast, autonomous and robust atmospheric retrieval methods. We present a new approach using unsupervised machine learning to generate informed priors for retrieval of exoplanetary atmosphere parameters from transmission spectra. We use principal component analysis (PCA) to efficiently compress the information content of a library of transmission spectra forward models generated using the PLATON package. We then apply a -means clustering algorithm in PCA space to segregate the library into discrete classes. We show that our classifier is almost always able to instantaneously place a previously unseen spectrum into the correct class, for low-to-moderate spectral resolutions, , in the range and noise levels up to ~per~cent of the peak-to-trough spectrum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
