Exploring Dimensionality Reduction of SDSS Spectral Abundances
Qianyu Fan

TL;DR
This paper compares five dimensionality reduction techniques on SDSS stellar spectra to identify the most effective methods for preserving chemical abundance information, highlighting the advantages of non-linear approaches like Autoencoders.
Contribution
It systematically evaluates and ranks multiple dimensionality reduction methods on SDSS spectral data, emphasizing the benefits of non-linear techniques over linear ones.
Findings
Autoencoder outperforms PCA in preserving information.
Non-linear methods show ~10% improvement over linear methods.
Performance ranking: PCA < UMAP < t-SNE < VAE < Autoencoder.
Abstract
High-resolution stellar spectra offer valuable insights into atmospheric parameters and chemical compositions. However, their inherent complexity and high-dimensionality present challenges in fully utilizing the information they contain. In this study, we utilize data from the Apache Point Observatory Galactic Evolution Experiment (APOGEE) within the Sloan Digital Sky Survey IV (SDSS-IV) to explore latent representations of chemical abundances by applying five dimensionality reduction techniques: PCA, t-SNE, UMAP, Autoencoder, and VAE. Through this exploration, we evaluate the preservation of information and compare reconstructed outputs with the original 19 chemical abundance data. Our findings reveal a performance ranking of PCA < UMAP < t-SNE < VAE < Autoencoder, through comparing their explained variance under optimized MSE. The performance of non-linear (Autoencoder and VAE)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAstronomy and Astrophysical Research · Gamma-ray bursts and supernovae · Astronomical Observations and Instrumentation
