CLiMB: A Domain-Informed Novelty Detection Clustering Framework for Galactic Archaeology and Scientific Discovery
Lorenzo Monti, Tatiana Muraveva, Brian Sheridan, Davide Massari, Alessia Garofalo, Gisella Clementini, Umberto Michelucci

TL;DR
CLiMB is a novel clustering framework that effectively combines prior domain knowledge with density-based methods to detect known and unknown structures in astronomical data, enhancing scientific discovery.
Contribution
This paper introduces CLiMB, a two-phase, domain-informed clustering framework that decouples known cluster anchoring from novelty exploration, improving anomaly detection in galactic archaeology.
Findings
Achieves high accuracy in recovering known Milky Way substructures (ARI=0.829).
Outperforms baseline methods with ARI below 0.20.
Successfully identifies new dynamical features in unlabelled data.
Abstract
In data-driven scientific discovery, a challenge lies in classifying well-characterized phenomena while identifying novel anomalies. Current semi-supervised clustering algorithms do not always fully address this duality, often assuming that supervisory signals are globally representative. Consequently, methods often enforce rigid constraints that suppress unanticipated patterns or require a pre-specified number of clusters, rendering them ineffective for genuine novelty detection. To bridge this gap, we introduce CLiMB (CLustering in Multiphase Boundaries), a domain-informed framework decoupling the exploitation of prior knowledge from the exploration of unknown structures. Using a sequential two-phase approach, CLiMB first anchors known clusters using metric-adaptive constrained partitioning, and subsequently applies density-based clustering to residual data to reveal arbitrary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStellar, planetary, and galactic studies · Astronomy and Astrophysical Research · Astrophysics and Star Formation Studies
