A Wavelength-Aware Unsupervised Learning Approach for Large, Multicolor, Photometric Surveys
Bradley D. Hutchinson, Catherine A. Pilachowski, Christian I. Johnson

TL;DR
This paper presents a novel wavelength-aware unsupervised learning method using LSTM autoencoders to analyze large multicolor photometric datasets, effectively reconstructing stellar spectral energy distributions and identifying rare stellar types.
Contribution
Introduces a wavelength-aware LSTM autoencoder that encodes multiband photometry into a latent space, enabling improved data reconstruction and rare object detection in large astronomical surveys.
Findings
99.51% of stars have their SEDs reconstructed within 0.05 mag
Model likely denoises photometric data, enhancing measurement quality
Rare stellar types can be detected through poor reconstruction signals
Abstract
Observational astronomy has undergone a significant transformation driven by large-scale surveys, such as the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS) Survey, the Sloan Digital Sky Survey (SDSS), and the Gaia Mission. These programs yield large, complex datasets that pose significant challenges for conventional analysis methods, and as a result, many different machine learning techniques are being tested and deployed. We introduce a new approach to analyzing multiband photometry by using a long-short term memory autoencoder (LSTM-AE). This model provides input-dependent reweighting across passbands on a star-by-star basis, enabling it to encode patterns present in the stars' spectral energy distributions (SEDs) into a two-dimensional latent space. We showcase this by using Pan-STARRS grizy mean magnitudes, and we use globular clusters, labels from SIMBAD, Gaia…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
