Attention-based Neural Network Emulators for Multi-Probe Data Vectors Part III: Modeling The Next Generation Surveys

Yijie Zhu; Evan Saraivanov; Joshua A. Kable; Artemis Sofia Giannakopoulou; Amritpal Nijjar; Vivian Miranda; Marco Bonici; Tim Eifler; and Elisabeth Krause

arXiv:2505.22574·astro-ph.CO·July 25, 2025

Attention-based Neural Network Emulators for Multi-Probe Data Vectors Part III: Modeling The Next Generation Surveys

Yijie Zhu, Evan Saraivanov, Joshua A. Kable, Artemis Sofia Giannakopoulou, Amritpal Nijjar, Vivian Miranda, Marco Bonici, Tim Eifler, and Elisabeth Krause

PDF

Open Access

TL;DR

This paper demonstrates that attention-based neural network emulators can accurately model CMB power spectra within the standard cosmological model, reducing outliers and enabling direct application to real data for upcoming cosmological surveys.

Contribution

It introduces attention mechanisms into neural emulators for CMB spectra, achieving higher precision and broader applicability compared to traditional architectures.

Findings

01

Attention-based models reduce outliers in emulating CMB spectra.

02

Emulators achieve less than 10% outliers with 200,000-400,000 training vectors.

03

Models meet precision requirements for current and future CMB experiments.

Abstract

Machine learning can accelerate cosmological inferences that involve many sequential evaluations of computationally expensive data vectors. Previous works in this series have examined how machine learning architectures impact emulator accuracy and training time for optical shear and galaxy clustering 2-point function. In this final manuscript, we explore neural network performance when emulating Cosmic Microwave Background temperature and polarization power spectra. We maximize the volume of applicability in the parameter space of our emulators within the standard $Λ$ -cold-dark-matter model while ensuring that errors are below cosmic variance. Relative to standard multi-layer perceptron architectures, we find the dot-product-attention mechanism reduces the number of outliers among testing cosmologies, defined as the fraction of testing points with $Δ χ^{2} > 0.2$ relative to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications