The common origin of symmetry and structure in genetic sequences
Giampaolo Cristadoro, Mirko Degli Esposti, Eduardo G. Altmann

TL;DR
This paper uncovers hierarchical symmetries in genetic sequences, linking non-random structures and symmetry through a simple domain model, supported by analysis of the human genome.
Contribution
It reveals previously unknown symmetries in genetic sequences and explains them with a domain model considering mobile elements, unifying two statistical features.
Findings
Hierarchical symmetries organized across scales in genetic sequences.
Confirmation of symmetries through analysis of the human genome.
Explanation of structures and symmetries via a simple domain model.
Abstract
Biologists have long sought a way to explain how statistical properties of genetic sequences emerged and are maintained through evolution. On the one hand, non-random structures at different scales indicate a complex genome organisation. On the other hand, single-strand symmetry has been scrutinised using neutral models in which correlations are not considered or irrelevant, contrary to empirical evidence. Different studies investigated these two statistical features separately, reaching minimal consensus despite sustained efforts. Here we unravel previously unknown symmetries in genetic sequences, which are organized hierarchically through scales in which non-random structures are known to be present. These observations are confirmed through the statistical analysis of the human genome and explained through a simple domain model. These results suggest that domain models which account…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
