Efficient computation of the joint sample frequency spectra for multiple   populations

John A. Kamm; Jonathan Terhorst; Yun S. Song

arXiv:1503.01133·math.PR·March 3, 2016

Efficient computation of the joint sample frequency spectra for multiple populations

John A. Kamm, Jonathan Terhorst, Yun S. Song

PDF

TL;DR

This paper introduces new formulas and algorithms for efficiently computing the joint sample frequency spectrum across multiple populations with complex demographic histories, improving stability and scalability.

Contribution

It provides novel analytic formulas and algorithms for the expected joint SFS, enabling efficient inference in complex multi-population models.

Findings

01

Enhanced numerical stability in computations

02

Reduced computational complexity for large samples

03

Successful application to empirical data with many populations

Abstract

A wide range of studies in population genetics have employed the sample frequency spectrum (SFS), a summary statistic which describes the distribution of mutant alleles at a polymorphic site in a sample of DNA sequences. In particular, recently there has been growing interest in analyzing the joint SFS data from multiple populations to infer parameters of complex demographic histories, including variable population sizes, population split times, migration rates, admixture proportions, and so on. Although much methodological progress has been made, existing SFS-based inference methods suffer from numerical instability and high computational complexity when multiple populations are involved and the sample size is large. In this paper, we present new analytic formulas and algorithms that enable efficient computation of the expected joint SFS for multiple populations related by a complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.