Category theory for genetics II: genotype, phenotype and haplotype
R\'emy Tuy\'eras

TL;DR
This paper develops an algebraic framework using category theory to analyze genetic structures like haplotypes and phenotypes, enabling better understanding of genetic interactions and population genetics.
Contribution
It introduces a novel algebraic approach based on idempotent commutative monoids and pedigrads to model genetic relationships and recombination processes.
Findings
Framework clarifies haplotype-phenotype associations.
Enables algebraic analysis of linkage disequilibrium.
Supports modeling of genetic mechanisms over generations.
Abstract
The overarching goal of this paper is to solve the word problem for a class of idempotent commutative monoids whose elements model population haplotypes. More specifically, we design an algebraic framework in which it is possible to unravel population stratification relationships and infer linkage disequilibrium in terms of algebraic equations of haplotypes expressed in idempotent commutative monoids. We show how these relations can be used to clarify haplotype-phenotype associations through the consideration of intermediate phenotypes and genetic mechanisms such as segregation and homologous recombination. The present work paves the way for the implementation of combinatorial GWAS in the study of complex traits, and for a framework in which one can infer genetic variants interactions along with the corresponding regulatory circuitry. Throughout the paper, we formalize concepts such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Biomedical Text Mining and Ontologies · Gene expression and cancer classification
