Long-term evolution of regulatory DNA sequences. Part 2: Theory and future challenges
Elia Mascolo, R\'eka Borb\'ely, Noa Ottilie Borst, Nicholas H Barton, Justin Crocker, Ga\v{s}per Tka\v{c}ik

TL;DR
This paper reviews the long-term evolution of regulatory DNA sequences, focusing on theoretical models, evolutionary concepts, and future challenges in understanding how gene regulatory elements evolve and adapt over time.
Contribution
It synthesizes existing models and concepts to evaluate the potential for a unifying theory of regulatory sequence evolution and highlights key open challenges.
Findings
Insights into the evolution of CREs and their regulatory functions.
Evaluation of the connectivity and evolvability of regulatory sequences.
Identification of open challenges in modeling long-term regulatory evolution.
Abstract
Promoters and enhancers are cis-regulatory elements (CREs), DNA sequences that bind transcription factor (TF) proteins to up- or down-regulate target genes. Decades-long efforts yielded TF-DNA interaction models that predict how strongly an individual TF binds arbitrary DNA sequences and how individual binding events on the CRE combine to affect gene expression. These insights can be synthesized into a global, biophysically-realistic, and quantitative genotype-phenotype (GP) map for gene regulation, a "holy grail" for the application of evolutionary theory. A global map provides a rare opportunity to simulate long-term evolution of regulatory sequences and pose several fundamental questions: How long does it take to evolve CREs de novo? How many non-trivial regulatory functions exist in sequence space? How connected are they? For which regulatory architecture is CRE evolution most rapid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Chromatin Dynamics · DNA and Nucleic Acid Chemistry · Gene Regulatory Network Analysis
