Formation of regulatory modules by local sequence duplication
Armita Nourmohammad, Michael Laessig

TL;DR
This study reveals that local sequence duplications are a major mechanism for the rapid formation and evolution of regulatory modules in the fly genome, contributing to the complexity of higher eukaryotic gene regulation.
Contribution
It demonstrates that local sequence duplication is a prevalent mode of regulatory site formation, shaping the architecture of cis-regulatory modules in higher eukaryotes.
Findings
Neighboring transcription factor binding sites often originate from common sequences via duplication.
Duplications can seed new regulatory sites near existing ones, which then evolve through point mutations.
Significant sequence similarity among binding site pairs extends up to 50 bp in fly regulatory modules.
Abstract
Turnover of regulatory sequence and function is an important part of molecular evolution. But what are the modes of sequence evolution leading to rapid formation and loss of regulatory sites? Here, we show that a large fraction of neighboring transcription factor binding sites in the fly genome have formed from a common sequence origin by local duplications. This mode of evolution is found to produce regulatory information: duplications can seed new sites in the neighborhood of existing sites. Duplicate seeds evolve subsequently by point mutations, often towards binding a different factor than their ancestral neighbor sites. These results are based on a statistical analysis of 346 cis-regulatory modules in the Drosophila melanogaster genome, and a comparison set of intergenic regulatory sequence in Saccharomyces cerevisiae. In fly regulatory modules, pairs of binding sites show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
