Systematic biases when using deep neural networks for annotating large catalogs of astronomical images
Sanchari Dhar, Lior Shamir

TL;DR
Deep neural networks used for astronomical image annotation can introduce biases related to sky location, affecting galaxy classification and potentially causing cosmological-scale anisotropies in catalogs.
Contribution
This study reveals that training data context influences DCNN biases in galaxy morphology classification, highlighting the need for careful training set construction.
Findings
Sky location affects DCNN classification bias.
Bias leads to detectable cosmological anisotropy.
Training set considerations are crucial for unbiased results.
Abstract
Deep convolutional neural networks (DCNNs) have become the most common solution for automatic image annotation due to their non-parametric nature, good performance, and their accessibility through libraries such as TensorFlow. Among other fields, DCNNs are also a common approach to the annotation of large astronomical image databases acquired by digital sky surveys. One of the main downsides of DCNNs is the complex non-intuitive rules that make DCNNs act as a ``black box", providing annotations in a manner that is unclear to the user. Therefore, the user is often not able to know what information is used by the DCNNs for the classification. Here we demonstrate that the training of a DCNN is sensitive to the context of the training data such as the location of the objects in the sky. We show that for basic classification of elliptical and spiral galaxies, the sky location of the galaxies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing in Agriculture
MethodsDiffusion-Convolutional Neural Networks
