Inverse population genetic problems with noise: inferring extent and structure of haplotype blocks from point allele frequencies
Oliver Keatinge Clay

TL;DR
This paper addresses inverse genetic problems by inferring haplotype block boundaries and structures from allele frequency data, using Boolean string models to handle noisy, biallelic genome information.
Contribution
It introduces a novel framework for inferring haplotype block extent and structure solely from allele frequencies, applying Boolean models to noisy genomic data.
Findings
Framework effectively infers block boundaries from allele frequencies.
Boolean string models simplify analysis of biallelic genome data.
Application to human chromosome demonstrates practical utility.
Abstract
A haplotype block, or simply a block, is a chromosomal segment, DNA base sequence or string that occurs in only a few variants or types in the genomes of a population of interest, and that has an encapsulated or 'private' frequency distribution of the string types that is not shared by neighbouring blocks or regions on the same chromosome. We consider two inverse problems of genetic interest: from just the frequencies of the symbol types (4 base types, possible single-base alleles) at each position (point, base/nucleotide) along the string, infer the location of the left and right boundaries of the block (block extent), and the number and relative frequencies of the string types occurring in the block (block structure). The large majority of variable positions in human and also other (e.g., fungal) genomes appear to be biallelic, i.e., the position allows only a choice between two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Gene Regulatory Network Analysis · RNA and protein synthesis mechanisms
MethodsBalanced Selection
