Systematic biases when using deep neural networks for annotating large   catalogs of astronomical images

Sanchari Dhar; Lior Shamir

arXiv:2201.03131·astro-ph.GA·January 11, 2022

Systematic biases when using deep neural networks for annotating large catalogs of astronomical images

Sanchari Dhar, Lior Shamir

PDF

Open Access

TL;DR

Deep neural networks used for astronomical image annotation can introduce biases related to sky location, affecting galaxy classification and potentially causing cosmological-scale anisotropies in catalogs.

Contribution

This study reveals that training data context influences DCNN biases in galaxy morphology classification, highlighting the need for careful training set construction.

Findings

01

Sky location affects DCNN classification bias.

02

Bias leads to detectable cosmological anisotropy.

03

Training set considerations are crucial for unbiased results.

Abstract

Deep convolutional neural networks (DCNNs) have become the most common solution for automatic image annotation due to their non-parametric nature, good performance, and their accessibility through libraries such as TensorFlow. Among other fields, DCNNs are also a common approach to the annotation of large astronomical image databases acquired by digital sky surveys. One of the main downsides of DCNNs is the complex non-intuitive rules that make DCNNs act as a ``black box", providing annotations in a manner that is unclear to the user. Therefore, the user is often not able to know what information is used by the DCNNs for the classification. Here we demonstrate that the training of a DCNN is sensitive to the context of the training data such as the location of the objects in the sky. We show that for basic classification of elliptical and spiral galaxies, the sky location of the galaxies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote Sensing in Agriculture

MethodsDiffusion-Convolutional Neural Networks