# Remote homology identification of the Drosophila melanogaster ortholog of the RNA Polymerase I subunit Rpa34/POLR1G

**Authors:** Ryan Palumbo, Bruce Knutson

PMC · DOI: 10.17912/micropub.biology.001107 · microPublication Biology · 2024-01-12

## TL;DR

This paper identifies a missing RNA polymerase subunit in fruit flies using structural and evolutionary analysis.

## Contribution

CG11076 is identified as the Drosophila ortholog of Rpa34/POLR1G using remote homology and structural analysis.

## Key findings

- CG11076 shows high structural conservation with Rpa34/POLR1G.
- Phylogenetic analysis confirms CG11076 is closely related to Rpa34/POLR1G.
- Combining sequence and structure improves identification of divergent orthologs.

## Abstract

Highly conserved orthologous proteins are easily identified by sequence homology alone, whereas poorly conserved orthologs require additional structural information to be identified. All
Drosophila 
orthologs of RNA polymerase I, II, and III subunits—except one—have been identified by sequence homology. Here, we identified CG11076 as the missing Rpa34/POLR1G ortholog in
Drosophila
. Remote homology detection and secondary structure analysis showed that CG11076 is predicted to have high structural conservation with Rpa34/POLR1G, and phylogenetic analysis demonstrated that these proteins are closely related. Our work underscores the importance of utilizing both sequence and structure to identify highly divergent orthologous proteins in different species.

## Linked entities

- **Genes:** CG11076 (uncharacterized protein) [NCBI Gene 43830], POLR1G (RNA polymerase I subunit G) [NCBI Gene 10849], POLR1G (RNA polymerase I subunit G) [NCBI Gene 10849]
- **Species:** Drosophila (taxon 7215), Drosophila melanogaster (taxon 7227)

## Full-text entities

- **Genes:** GTF2F2P1 (general transcription factor IIF subunit 2 pseudogene 1) [NCBI Gene 2964] {aka GTF2F2L, TFIIF}, POLI (DNA polymerase iota) [NCBI Gene 11201] {aka RAD30B, RAD3OB, eta2}, RPA34 (DNA-directed RNA polymerase I subunit RPA34) [NCBI Gene 853293] {aka CST21}, POLR3D (RNA polymerase III subunit D) [NCBI Gene 661] {aka BN51T, C53, RPC4, RPC53, TSBN51}, RPA49 (DNA-directed RNA polymerase I subunit RPA49) [NCBI Gene 855473], Polr1A (RNA polymerase I subunit A) [NCBI Gene 36617] {aka 153129_at, CG10122, DmRPA1, Dmel\CG10122, RPA1, RPA190}, RPC53 (DNA-directed RNA polymerase III subunit C53) [NCBI Gene 851404] {aka RPC4}, POLR1E (RNA polymerase I subunit E) [NCBI Gene 64425] {aka A49, PAF53, PRAF1, RPA49}, CG11076 (uncharacterized protein) [NCBI Gene 43830] {aka Dmel\CG11076}, TFG2 (transcription factor IIF subunit TFG2) [NCBI Gene 852888], POLR1G (RNA polymerase I subunit G) [NCBI Gene 10849] {aka ASE-1, ASE1, CAST, CD3EAP, PAF49, RPA34}
- **Species:** Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Homo sapiens (human, species) [taxon 9606], Drosophila melanogaster (fruit fly, species) [taxon 7227], Nakaseomyces glabratus (species) [taxon 5478]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10823793/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC10823793/full.md

## References

15 references — full list in the complete paper: https://tomesphere.com/paper/PMC10823793/full.md

---
Source: https://tomesphere.com/paper/PMC10823793