TransformerPayne: enhancing spectral emulation accuracy and data efficiency by capturing long-range correlations
Tomasz R\'o\.za\'nski, Yuan-Sen Ting, and Maja Jab{\l}o\'nska

TL;DR
TransformerPayne leverages Transformer models to significantly improve spectral emulation accuracy, data efficiency, and interpretability over traditional neural network approaches, especially in capturing long-range spectral correlations.
Contribution
This work introduces TransformerPayne, a novel spectral emulator that outperforms existing models by capturing long-range spectral features and enabling effective transfer learning.
Findings
TransformerPayne achieves ~0.15% MAE on full spectral grids.
It outperforms scaled-up The Payne by 2-5 times in accuracy.
Fine-tuning reduces training data requirements by up to tenfold.
Abstract
Stellar spectra emulators often rely on large grids and tend to reach a plateau in emulation accuracy, leading to significant systematic errors when inferring stellar properties. Our study explores the use of Transformer models to capture long-range information in spectra, comparing their performance to The Payne emulator (a fully connected multilayer perceptron), an expanded version of The Payne, and a convolutional-based emulator. We tested these models on synthetic spectra grids, evaluating their performance by analyzing emulation residuals and assessing the quality of spectral parameter inference. The newly introduced TransformerPayne emulator outperformed all other tested models, achieving a mean absolute error (MAE) of approximately 0.15% when trained on the full grid. The most significant improvements were observed in grids containing between 1000 and 10,000 spectra, with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsThin-Film Transistor Technologies · Magneto-Optical Properties and Applications
