# Evolutionary turnover of key amino acids explains conservation of function without conservation of sequence in transcriptional activation domains

**Authors:** Claire J. LeBlanc, Jordan Stefani, Melvin Soriano, Angelica W. Y. Lam, Marissa A. Zintel, Sanjana R. Kotha, Emily P. Chase, Giovani Pimentel-Solorio, Aditya Vunnum, Gean Hu, Katherine L. Flug, Aaron Fultineer, Niklas Hummel, Max V. Staller, Michael Guertin, Michael Guertin, Michael Guertin

PMC · DOI: 10.1371/journal.pgen.1012069 · 2026-03-16

## TL;DR

This study shows how transcription factors can maintain their function over long evolutionary times despite significant changes in their protein sequences.

## Contribution

The paper reveals that evolutionary turnover of key amino acids in disordered regions maintains function without sequence conservation.

## Key findings

- The central activation domain of Gcn4 shows strong functional conservation despite sequence divergence.
- Evolutionary turnover of acidic and aromatic residues and short linear motifs contributes to functional conservation.
- Turnover of entire activation domains in full-length transcription factors supports functional stability over time.

## Abstract

In folded protein domains, protein function is frequently more conserved than amino acid sequence because highly diverged sequences can fold into equivalent 3D structures with identical function. During evolution, intrinsically disordered protein regions (IDRs) often experience rapid amino acid sequence divergence, but because they do not fold into stable 3D structures, it remains largely unknown when and how function is conserved. As a model system for studying the evolution of IDRs, we examined transcriptional activation domains, the regions of transcription factors that bind to coactivator complexes. We systematically identified activation domains on 502 homologs of the transcriptional activator Gcn4 spanning 600 MY of fungal evolution in the Ascomycota. We found that the central activation domain shows strong conservation of function without conservation of sequence. This conservation of function without conservation of sequence arises from evolutionary turnover (gain and loss) at two length scales. Within the central activation domain, we see turnover of acidic and aromatic residues, but primarily loss of short linear motifs. In the full-length transcription factor, we see turnover of entire activation domains. Stabilizing selection and evolutionary turnover at multiple length scales are likely a general mechanism for conservation of function without conservation of sequence in IDRs.

When and where genes are turned on determine what an organism looks like and how it responds to its environment. The turning on and off of genes is controlled by proteins known as transcription factors. Transcription factors have two main jobs: to bind the genome and to turn genes on or off. The protein regions of the transcription factors that bind the genome have been conserved for billions of years. However, throughout this time the rest of the transcription factor protein sequence has changed substantially. In this study, we investigate how changes in the sequences of these other regions impact the ability of transcription factors to turn on genes. We find that these transcription factor regions are still able to turn on genes despite large changes in protein sequence. The function of these transcription factors is much more conserved than their protein sequences. The protein sequence of these activating regions can change dramatically while maintaining function. These findings improve our understanding of how these transcription factors turn on genes and how they are evolving.

## Linked entities

- **Genes:** GCN4 (amino acid starvation-responsive transcription factor GCN4) [NCBI Gene 856709]
- **Species:** Ascomycota (taxon 4890)

## Full-text entities

- **Genes:** GCN4 (amino acid starvation-responsive transcription factor GCN4) [NCBI Gene 856709] {aka AAS101, AAS3, ARG9}, PCL5 (Pcl5p) [NCBI Gene 856468], PDR1 (drug-responsive transcription factor PDR1) [NCBI Gene 852871] {aka AMY1, ANT1, BOR2, CYH3, NRA2, SMR2}, PHO85 (cyclin-dependent serine/threonine-protein kinase PHO85) [NCBI Gene 856076] {aka LDB15}, PHO80 (Pho80p) [NCBI Gene 854161] {aka AGS3, TUP7, VAC5}, GAL11 (Gal11p) [NCBI Gene 854106] {aka ABE1, MED15, RAR3, SDS4, SPT13}
- **Diseases:** AD (MESH:D000544), intrinsic disorder (MESH:D020919), ADs (OMIM:612348), F (OMIM:102510)
- **Chemicals:** SC (MESH:D012538), water (MESH:D014867), P (MESH:D010758), Lithium Acetate (MESH:C488804), nitrogen (MESH:D009584), Glucose (MESH:D005947), proline (MESH:D011392), S (MESH:D013455), Aspartic acid (MESH:D001224), glutamic acid (MESH:D018698), L (MESH:D007930), amino acid (MESH:D000596), F (MESH:D005461), G418 (MESH:C010680), Glycerol (MESH:D005990), SP (MESH:C000604007), Amp (MESH:D000249), Methionine (MESH:D008715), T (MESH:D014316), chloroform (MESH:D002725), dipeptides (MESH:D004151), phenylalanine (MESH:D010649), HF (MESH:D006195), ADs (-)
- **Species:** Catenaria anguillulae (species) [taxon 109876], Aspergillus westerdijkiae (species) [taxon 357447], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Escherichia coli (E. coli, species) [taxon 562], Aspergillus tamarii (species) [taxon 41984], Drosophila melanogaster (fruit fly, species) [taxon 7227], Tortispora caseinolytica (species) [taxon 51930], Homo sapiens (human, species) [taxon 9606], Dendryphion nanum (species) [taxon 256645], Didymocrea sadasivanii (species) [taxon 372059], Mus musculus (house mouse, species) [taxon 10090]
- **Mutations:** T105, S24C, S24G, T105P, S24F, S24A, S23E, S23D, S24D, S23, leucine residues were mutated to alanine, S24E, S24H

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13004512/full.md

---
Source: https://tomesphere.com/paper/PMC13004512