Individual-Level SNP Diversity and Similarity Profiles
Zhanshan (Sam) Ma, Lianwei Li, Ya-Ping Zhang

TL;DR
This paper introduces a novel method for measuring individual-level SNP diversity using Hill numbers derived from Renyi entropy, providing a unified framework that complements existing population-based diversity metrics and aids in comparative genetic analysis.
Contribution
It proposes a new SNP diversity measure at the individual level based on Renyi-entropy-derived Hill numbers, unifying various diversity indexes and enabling better comparison and analysis.
Findings
Introduces SNP diversity profiles for individuals.
Unifies multiple diversity indexes through profiles.
Profiles can be estimated with rarefaction to address sequencing coverage issues.
Abstract
Classic concepts of genetic (gene) diversity (heterozygosity) such as Nei (1973: PNAS) and Nei and Li (1979: PNAS) nucleotide diversity were defined within the context of populations. Although variations are often measured in population context, the basic carriers of variation are individuals. Hence, measuring variations such as SNP of individual against a reference genome, which has been ignored currently, is certainly of its own right. Indeed, similar practice has been a tradition in ecology, where the basic framework of diversity measure is individual community sample. We propose to use Renyi-entropy-derived Hill numbers to define SNP (single nucleotide polymorphism) diversity (including alpha-, beta-, and gamma-diversities) and similarity profiles. Hill numbers are derived from Renyi entropy, of which Shannon entropy is a special case and which have found widely applications…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals
