Combining human and machine learning for morphological analysis of galaxy images
Evan Kuminski, Joe George, John Wallin, Lior Shamir

TL;DR
This paper explores combining citizen science data with machine learning to improve the morphological analysis of galaxy images, addressing the challenge of processing large datasets efficiently.
Contribution
It demonstrates that citizen science data can effectively train machine learning models for galaxy morphology classification, emphasizing data quality's importance.
Findings
Machine learning performance improves with higher-quality citizen science data.
Using consensus among volunteers enhances training data reliability.
The proposed method's source code is publicly available.
Abstract
The increasing importance of digital sky surveys collecting many millions of galaxy images has reinforced the need for robust methods that can perform morphological analysis of large galaxy image databases. Citizen science initiatives such as Galaxy Zoo showed that large datasets of galaxy images can be analyzed effectively by non-scientist volunteers, but since databases generated by robotic telescopes grow much faster than the processing power of any group of citizen scientists, it is clear that computer analysis is required. Here we propose to use citizen science data for training machine learning systems, and show experimental results demonstrating that machine learning systems can be trained with citizen science data. Our findings show that the performance of machine learning depends on the quality of the data, which can be improved by using samples that have a high degree of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
