Self-Supervised Representation Learning for Astronomical Images

Md Abul Hayat; George Stein; Peter Harrington; Zarija Luki\'c; Mustafa; Mustafa

arXiv:2012.13083·astro-ph.IM·June 30, 2022

Self-Supervised Representation Learning for Astronomical Images

Md Abul Hayat, George Stein, Peter Harrington, Zarija Luki\'c, Mustafa, Mustafa

PDF

1 Repo

TL;DR

This paper demonstrates that self-supervised learning on astronomical sky survey images can produce representations that outperform supervised methods in galaxy classification and redshift estimation, with fewer labeled data.

Contribution

The authors introduce a contrastive self-supervised learning framework for astronomical images that achieves superior performance with less labeled data compared to traditional supervised approaches.

Findings

01

Self-supervised representations outperform supervised models in galaxy morphology classification.

02

The approach achieves comparable accuracy to supervised models using 2-4 times fewer labels.

03

The method is effective for multiple scientific tasks, including redshift estimation.

Abstract

Sky surveys are the largest data generators in astronomy, making automated tools for extracting meaningful scientific information an absolute necessity. We show that, without the need for labels, self-supervised learning recovers representations of sky survey images that are semantically useful for a variety of scientific tasks. These representations can be directly used as features, or fine-tuned, to outperform supervised methods trained only on labeled data. We apply a contrastive learning framework on multi-band galaxy photometry from the Sloan Digital Sky Survey (SDSS) to learn image representations. We then use them for galaxy morphology classification, and fine-tune them for photometric redshift estimation, using labels from the Galaxy Zoo 2 dataset and SDSS spectroscopy. In both downstream tasks, using the same learned representations, we outperform the supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

georgestein/galaxy_search
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning