Attention-based Neural Network Emulators for Multi-Probe Data Vectors Part III: Modeling The Next Generation Surveys
Yijie Zhu, Evan Saraivanov, Joshua A. Kable, Artemis Sofia Giannakopoulou, Amritpal Nijjar, Vivian Miranda, Marco Bonici, Tim Eifler, and Elisabeth Krause

TL;DR
This paper demonstrates that attention-based neural network emulators can accurately model CMB power spectra within the standard cosmological model, reducing outliers and enabling direct application to real data for upcoming cosmological surveys.
Contribution
It introduces attention mechanisms into neural emulators for CMB spectra, achieving higher precision and broader applicability compared to traditional architectures.
Findings
Attention-based models reduce outliers in emulating CMB spectra.
Emulators achieve less than 10% outliers with 200,000-400,000 training vectors.
Models meet precision requirements for current and future CMB experiments.
Abstract
Machine learning can accelerate cosmological inferences that involve many sequential evaluations of computationally expensive data vectors. Previous works in this series have examined how machine learning architectures impact emulator accuracy and training time for optical shear and galaxy clustering 2-point function. In this final manuscript, we explore neural network performance when emulating Cosmic Microwave Background temperature and polarization power spectra. We maximize the volume of applicability in the parameter space of our emulators within the standard -cold-dark-matter model while ensuring that errors are below cosmic variance. Relative to standard multi-layer perceptron architectures, we find the dot-product-attention mechanism reduces the number of outliers among testing cosmologies, defined as the fraction of testing points with relative to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
