How to infer relative fitness from a sample of genomic sequences
Adel Dayarian, Boris I Shraiman

TL;DR
This paper introduces a method to infer the relative fitness of individuals in a population from genealogical tree shapes, using simulations to demonstrate its accuracy and potential applications in biology and medicine.
Contribution
It presents a novel heuristic algorithm that predicts individual fitness from genealogical tree shapes, exploiting information not fully utilized in previous methods.
Findings
Inferred fitness correlates strongly with actual fitness.
Samples of 200 genomes nearly maximize inference accuracy.
Method can predict future genotype inheritance in populations.
Abstract
Mounting evidence suggests that natural populations can harbor extensive fitness diversity with numerous genomic loci under selection. It is also known that genealogical trees for populations under selection are quantifiably different from those expected under neutral evolution and described statistically by Kingman's coalescent. While differences in the statistical structure of genealogies have long been used as a test for the presence of selection, the full extent of the information that they contain has not been exploited. Here we shall demonstrate that the shape of the reconstructed genealogical tree for a moderately large number of random genomic samples taken from a fitness diverse, but otherwise unstructured asexual population can be used to predict the relative fitness of individuals within the sample. To achieve this we define a heuristic algorithm, which we test in silico…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolution and Genetic Dynamics · Genetic diversity and population structure · Genomics and Phylogenetic Studies
