Decomposing the site frequency spectrum: the impact of tree topology on neutrality tests
Luca Ferretti, Alice Ledda, Guillaume Achaz, Thomas Wiehe and, Sebastian E. Ramos-Onsins

TL;DR
This paper analyzes how the shape and topology of genealogical trees influence population genetic statistics and neutrality tests, providing new insights and methods for interpreting these tests in evolutionary studies.
Contribution
It decomposes neutrality test statistics into components related to tree topology and waiting times, clarifies their impact, and introduces a new test for selection based on tree shape.
Findings
Values of Tajima's D depend on tree root balance.
A new test for selection is proposed and interpreted.
Extreme values of neutrality tests are characterized mathematically.
Abstract
We investigate the dependence of the site frequency spectrum (SFS) on the topological structure of genealogical trees. We show that basic population genetic statistics - for instance estimators of or neutrality tests such as Tajima's - can be decomposed into components of waiting times between coalescent events and of tree topology. Our results clarify the relative impact of the two components on these statistics. We provide a rigorous interpretation of positive or negative values of an important class of neutrality tests in terms of the underlying tree shape. In particular, we show that values of Tajima's and Fay and Wu's depend in a direct way on a peculiar measure of tree balance which is mostly determined by the root balance of the tree. We present a new test for selection in the same class as Fay and Wu's and discuss its interpretation and power. Finally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
