Information-theoretic characterization of the complete genotype-phenotype map of a complex pre-biotic world
Nitash C G, Christoph Adami (Michigan State University)

TL;DR
This paper uses information theory to analyze a vast genotype-phenotype map, revealing how biological information is robustly encoded and compressed across over 5 trillion sequences, advancing understanding of pre-biotic molecular information encoding.
Contribution
It provides the first comprehensive information-theoretic analysis of a complete genotype-phenotype map with over 5 trillion sequences, uncovering encoding patterns.
Findings
Revealed robust and compressed genetic encodings
Analyzed the largest complete genotype-phenotype map to date
Demonstrated the utility of information theory in biological sequence analysis
Abstract
How information is encoded in bio-molecular sequences is difficult to quantify since such an analysis usually requires sampling an exponentially large genetic space. Here we show how information theory reveals both robust and compressed encodings in the largest complete genotype-phenotype map (over 5 trillion sequences) obtained to date.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
