Two Point Correlation Function Estimation with Contaminated Data
Arya Farahi

TL;DR
This paper introduces a new estimator for the two-point correlation function that effectively corrects for contamination and incompleteness in survey data, improving accuracy and reducing bias in cosmological clustering measurements.
Contribution
The paper presents the prediction-powered Landy--Szalay (PP--LS) estimator, a novel method that combines noisy labels with a small spectroscopic subset to debias clustering estimates without complex modeling.
Findings
PP--LS removes bias caused by contamination in simulations.
PP--LS achieves lower variance than spectroscopic-only estimators.
The method is computationally efficient and compatible with existing pipelines.
Abstract
The two-point correlation function (2PCF) is a cornerstone of precision cosmology, yet its estimation from imaging surveys is vulnerable to contamination and incompleteness arising from imperfect target selection and pipeline-level inclusion decisions. In practice, the scientific target is a physically defined population, while the working catalog is constructed from noisy measurements and selection cuts, leading to mismatches between true and observed inclusion. These errors are often spatially structured, correlating with survey depth, observing conditions, and foregrounds, and can imprint spurious large-scale power or suppress the true clustering signal. High-resolution spectroscopic samples provide gold-standard inclusion in the target population but are typically available for only a small subset of objects. We introduce a prediction-powered Landy--Szalay (PP--LS) estimator that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGalaxies: Formation, Evolution, Phenomena · Geochemistry and Geologic Mapping · Gamma-ray bursts and supernovae
