TL;DR
This paper introduces new computational methods and software, LDpop, for accurately estimating two-locus sampling probabilities and recombination rates in populations with variable sizes, improving over previous constant-size assumptions.
Contribution
The authors develop a novel formula and software for computing two-locus sampling probabilities under piecewise constant population sizes, enhancing demographic modeling accuracy.
Findings
Accounting for population size changes improves recombination rate estimates.
The new methods handle moderate sample sizes and complex demographic histories.
LDpop provides accurate and scalable tools for population genetics analysis.
Abstract
Two-locus sampling probabilities have played a central role in devising an efficient composite likelihood method for estimating fine-scale recombination rates. Due to mathematical and computational challenges, these sampling probabilities are typically computed under the unrealistic assumption of a constant population size, and simulation studies have shown that resulting recombination rate estimates can be severely biased in certain cases of historical population size changes. To alleviate this problem, we develop here new methods to compute the sampling probability for variable population size functions that are piecewise constant. Our main theoretical result, implemented in a new software package called LDpop, is a novel formula for the sampling probability that can be evaluated by numerically exponentiating a large but sparse matrix. This formula can handle moderate sample sizes ($n…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
