$\texttt{Exoformer}$: Accelerating Bayesian atmospheric retrievals with transformer neural networks
L. Pagliaro, T. Zingales, G. Piotto, I. Giovannini, and G. Mantovan

TL;DR
This paper introduces Exoformer, a transformer neural network that accelerates Bayesian atmospheric retrievals for exoplanets by generating informative priors, significantly reducing computation time while maintaining accuracy.
Contribution
Exoformer is a novel transformer-based model that provides informative priors to speed up Bayesian retrievals of exoplanet atmospheres, demonstrated on JWST data with preserved accuracy.
Findings
Retrievals accelerated by a factor of 3-8 using Exoformer-derived priors.
Retrieved parameters and spectra remain consistent with classical methods.
Bayesian evidence comparison shows no strong preference between approaches.
Abstract
Computationally expensive and time-consuming Bayesian atmospheric retrievals pose a significant bottleneck for the rapid analysis of high-quality exoplanetary spectra from present and next generation space telescopes, such as JWST and Ariel. As these missions demand more complex atmospheric models to fully characterize the spectral features they uncover, they will benefit from data-driven analysis techniques such as machine and deep learning. We introduce and detail a novel approach that uses a transformer-based neural network () to rapidly generate informative prior distributions for atmospheric transmission spectra of hot Jupiters. We demonstrate the effectiveness of using both simulated observations and real JWST data of WASP-39b and WASP-17b within the TauREx retrieval framework, leveraging the nested sampling algorithm. By replacing standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
