Participatory Science and Machine Learning Applied to Millions of Sources in the Hobby-Eberly Telescope Dark Energy Experiment
Lindsay R. House, Karl Gebhardt, Keely Finkelstein, Erin Mentuch, Cooper, Dustin Davis, Daniel J. Farrow, Donald P. Schneider

TL;DR
This paper combines participatory citizen science with machine learning to efficiently classify millions of sources in the HETDEX survey, significantly reducing false positives and enabling analysis of large, low signal-to-noise datasets.
Contribution
It introduces a scalable approach that leverages citizen science data to train machine learning models for large-scale astronomical source classification.
Findings
Over 6 million classifications from volunteers
Achieved 94% confidence in false positive identification
Expanded visually-vetted sample from 14,000 to 190,000 sources
Abstract
We are merging a large participatory science effort with machine learning to enhance the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX). Our overall goal is to remove false positives, allowing us to use lower signal-to-noise data and sources with low goodness-of-fit. With six million classifications through Dark Energy Explorers, we can confidently determine if a source is not real at over 94% confidence level when classified by at least ten individuals; this confidence level increases for higher signal-to-noise sources. To date, we have only been able to apply this direct analysis to 190,000 sources. The full sample of HETDEX will contain around 2-3M sources, including nearby galaxies ([O II] emitters), distant galaxies (Lyman-alpha emitters or LAEs), false positives, and contamination from instrument issues. We can accommodate this tenfold increase by using machine learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
