Practical Galaxy Morphology Tools from Deep Supervised Representation   Learning

Mike Walmsley; Anna M. M. Scaife; Chris Lintott; Michelle Lochner,; Verlon Etsebeth; Tobias G\'eron; Hugh Dickinson; Lucy Fortson; Sandor Kruk,; Karen L. Masters; Kameswara Bharadwaj Mantha; Brooke D. Simmons

arXiv:2110.12735·astro-ph.GA·June 9, 2022

Practical Galaxy Morphology Tools from Deep Supervised Representation Learning

Mike Walmsley, Anna M. M. Scaife, Chris Lintott, Michelle Lochner,, Verlon Etsebeth, Tobias G\'eron, Hugh Dickinson, Lucy Fortson, Sandor Kruk,, Karen L. Masters, Kameswara Bharadwaj Mantha, Brooke D. Simmons

PDF

1 Repo

TL;DR

Deep supervised models trained on Galaxy Zoo data learn meaningful galaxy representations that enable accurate similarity searches, anomaly detection, and efficient adaptation to new tasks with minimal additional labels, challenging the need for large new datasets.

Contribution

The paper demonstrates that deep supervised models trained on existing galaxy data produce transferable representations useful for various practical tasks in astronomy.

Findings

01

Models outperform recent approaches in galaxy similarity search.

02

100% accuracy in identifying top anomalies judged by volunteers.

03

Fine-tuned models outperform ImageNet-based models and training from scratch in new tasks.

Abstract

Astronomers have typically set out to solve supervised machine learning problems by creating their own representations from scratch. We show that deep learning models trained to answer every Galaxy Zoo DECaLS question learn meaningful semantic representations of galaxies that are useful for new tasks on which the models were never trained. We exploit these representations to outperform several recent approaches at practical tasks crucial for investigating large galaxy samples. The first task is identifying galaxies of similar morphology to a query galaxy. Given a single galaxy assigned a free text tag by humans (e.g. "#diffuse"), we can find galaxies matching that tag for most tags. The second task is identifying the most interesting anomalies to a particular researcher. Our approach is 100% accurate at identifying the most interesting 100 anomalies (as judged by Galaxy Zoo 2…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mwalmsley/zoobot
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.