Euclid Quick Data Release (Q1). From simulations to sky: Advancing machine-learning lens detection with real Euclid data
Euclid Collaboration: N. E. P. Lines (1), T. E. Collett (1), P. Holloway (1), K. Rojas (2), S. Schuldt (3, 4), R. B. Metcalf (5, 6), T. Li (1), A. Verma (7), G. Despali (5, 6, 8), F. Courbin (9, 10, 11), R. Gavazzi (12, 13), C. Tortora (14), B. Cl\'ement (15, 16)

TL;DR
This paper evaluates the performance gap of machine learning lens detection models trained on simulations versus real Euclid data, and proposes a hybrid training approach to improve real-world detection efficiency.
Contribution
It demonstrates that combining simulated and real Euclid lenses in training significantly enhances lens detection performance on actual survey data.
Findings
Simulated-only training achieves 92% recall but low precision on real data.
Including real lenses in training improves completeness by 25-30%.
Hybrid training strategies maximize lens discovery potential in Euclid data.
Abstract
In the era of large-scale surveys like Euclid, machine learning has become an essential tool for identifying rare yet scientifically valuable objects, such as strong gravitational lenses. However, supervised machine-learning approaches require large quantities of labelled examples to train on, and the limited number of known strong lenses has lead to a reliance on simulations for training. A well-known challenge is that machine-learning models trained on one data domain often underperform when applied to a different domain: in the context of lens finding, this means that strong performance on simulated lenses does not necessarily translate into equally good performance on real observations. In Euclid's Quick Data Release 1 (Q1), covering 63 deg2, 500 strong lens candidates were discovered through a synergy of machine learning, citizen science, and expert visual inspection. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)
