CLARA: A Modular Framework for Unsupervised Transit Detection Using TESS Light Curves
Mainak Dasgupta

TL;DR
CLARA introduces a modular, unsupervised framework using Random Forests for transit detection in TESS light curves, demonstrating improved detection rates and interpretability across multiple sectors.
Contribution
This work presents a novel unsupervised transit detection framework that leverages synthetic training sets and morphological analysis, enhancing generalization and interpretability.
Findings
Achieved a 14.04% detection rate with optimized clustering.
Enriched transit candidate detection over baseline rates.
Processed over 87,000 TESS light curves successfully.
Abstract
We present CLARA, a modular framework for unsupervised transit detection in TESS light curves, leveraging Unsupervised Random Forests (URFs) trained on synthetic datasets and guided by morphological similarity analysis. This work addresses two core questions: (a) How does the design of synthetic training sets affect the performance and generalization of URFs across independent TESS sectors? (b) Do URF anomaly scores correlate with genuine astrophysical phenomena, enabling effective identification and clustering of transit-like signals? We investigate these questions through a two-part study focused on (1) detection performance optimization, and (2) the physical interpretability of anomalies. In Part I, we introduce three URF model variants tuned via alpha-controlled scoring objectives, and evaluate their generalization across five TESS sectors. This large-scale test involved scoring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
