On Correlation Detection and Alignment Recovery of Gaussian Databases
Ran Tamir

TL;DR
This paper introduces a novel two-stage algorithm for detecting correlation and recovering partial or full alignment between Gaussian databases, with improved error bounds and computational efficiency over existing methods.
Contribution
It presents a new correlation detection method with graph-theoretic bounds and multiple algorithms for partial and full alignment recovery, enhancing accuracy and efficiency.
Findings
The proposed detector outperforms recent detectors in specific parameter regimes.
Developed a graph-theoretic technique for bounding moments of dependent indicator sums.
Algorithms achieve reliable alignment recovery with reduced computational complexity.
Abstract
In this work, we propose an efficient two-stage algorithm solving a joint problem of correlation detection and partial alignment recovery between two Gaussian databases. Correlation detection is a hypothesis testing problem; under the null hypothesis, the databases are independent, and under the alternate hypothesis, they are correlated, under an unknown row permutation. We develop bounds on the type-I and type-II error probabilities, and show that the analyzed detector performs better than a recently proposed detector, at least for some specific parameter choices. Since the proposed detector relies on a statistic, which is a sum of dependent indicator random variables, then in order to bound the type-I probability of error, we develop a novel graph-theoretic technique for bounding the -th order moments of such statistics. When the databases are accepted as correlated, the algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Bayesian Modeling and Causal Inference · Geochemistry and Geologic Mapping
