# Widespread turnover of a conserved cis-regulatory code across 589 grass species

**Authors:** Charles O Hale, Sheng-Kai Hsu, Jingjing Zhai, Aimee Schulz, Taylor Aubuchon-Elder, Germano Costa-Neto, Allen Gelfond, Mohamed Z El-Walid, Matthew Hufford, Elizabeth A Kellogg, Thuy La, Alexandre P Marand, Arun S Seetharam, Armin Scheben, Michelle C Stitzer, Travis Wrightsman, Maria Cinta Romay, Edward S Buckler

PMC · DOI: 10.1093/molbev/msaf324 · Molecular Biology and Evolution · 2025-12-10

## TL;DR

This study explores how regulatory DNA sequences in grasses have evolved over millions of years, revealing a conserved regulatory code with widespread changes in specific binding sites.

## Contribution

The study introduces a novel approach combining comparative genomics and phylogenetic mixed models to uncover cis-regulatory evolution across 589 grass species.

## Key findings

- 73% of identified cis-regulatory motifs are conserved across five diverse grass species.
- Shared motif occupancy declines rapidly with divergence time but remains for ∼50% back to the origin of grasses.
- Motif gains and losses are associated with ecological niche transitions, including HSF/GARP motifs linked to temperate environments.

## Abstract

The growing availability of genomes from non-model organisms offers new opportunities to identify functional loci underlying trait variation through comparative genomics. While cis-regulatory regions drive much of phenotypic evolution, linking them to specific functions remains challenging. We identified 514 cis-regulatory motifs enriched in regulatory regions of five diverse grass species, with 73% consistently enriched across all, suggesting a deeply conserved regulatory code. Leveraging 57 new contig-level genome assemblies, we then quantified shared occupancy of specific motif instances within gene-proximal regions across 589 grass species, revealing widespread gain and loss over evolutionary time. Shared occupancy declined rapidly over the first few million years of divergence, yet ∼50% of motif instances were shared back to the origin of grasses ∼100 million years ago. We used phylogenetic mixed models to identify motif gains and losses associated with ecological niche transitions. Our models revealed significant environmental associations across 1282 motif–orthogroup combinations, including convergent gains of HSF/GARP motifs at an alpha-N-acetylglucosaminidase gene associated with occurrence in temperate environments. Our findings support a “stable motifs, variable binding sites” model in which cis-regulatory evolution involves turnover of thousands of individual binding site instances while largely preserving transcription factors’ binding preferences. Our results highlight the potential of comparative genomics and phylogenetic mixed models to reveal the genetic basis of complex traits.

## Full-text entities

- **Genes:** NAGLU (N-acetyl-alpha-glucosaminidase) [NCBI Gene 4669] {aka CMT2V, MPS-IIIB, MPS3B, NAG, UFHSD}

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12819352/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12819352/full.md

## References

117 references — full list in the complete paper: https://tomesphere.com/paper/PMC12819352/full.md

---
Source: https://tomesphere.com/paper/PMC12819352