# ENcyclopedia of TRAnscription Factors in Bacteria and Archaea genomes (ENTRAF) version 2.0

**Authors:** Silvia Tenorio-Salgado, Cinthia Rodríguez Maya, Edgardo Galan-Vasquez, André Borges Farias, Dulce Álvarez-López, Jose Luis Villalpando-Aguilar, Alberto J Martin, Leonardo Ledesma-Dominguez, Ernesto Perez-Rueda

PMC · DOI: 10.1093/database/baaf071 · Database: The Journal of Biological Databases and Curation · 2025-10-29

## TL;DR

The paper introduces ENTRAF 2.0, a database of experimentally validated transcription factors in bacteria and archaea, providing detailed annotations and regulatory information.

## Contribution

The novel contribution is the collection and organization of 1784 experimentally validated TFs across multiple bacterial and archaeal phyla into a relational database.

## Key findings

- The database includes regulatory roles such as carbon assimilation, virulence, and antibiotic resistance.
- It provides structural and functional classifications of TFs, enhancing predictions in genome analysis.

## Abstract

DNA-binding transcription factors (TFs) have a central role in regulation of gene expression at the transcription initiation level. These proteins have been experimentally described in multiple bacterial and archaeal genomes. These descriptions have allowed their prediction in complete genomes. In this work, we collected 1784 experimentally validated TFs across 25 bacterial and seven archaeal phyla, including Gammaproteobacteria, Bacillota, and Actinomycetota in bacteria and Thermoproteota and Thermococci in archaea. The collection of regulatory proteins was organized into a relational database, named ENcyclopedia of TRAnscription Factors in Bacteria and Archaea genomes or ENTRAF. The database shows the experimental evidence for all the TFs [protein structure information (X-ray or NMR structural data); binding of purified proteins; footprinting assays; site mutation; in vitro transcription assay; and PRiMer extension analysis, among others], their global regulatory roles (carbon source assimilation, virulence, antibiotic resistance, stress, and DNA damage), evolutionary families, and structural classifications. In addition, we achieved a global description of the collection in terms of their regulatory mechanisms (activation, repression, and dual activities), structural diversity, functional categories, and protein families. We consider that this collection of well-annotated TFs could be used as a benchmark, enhancing the predictions for this class of proteins in complete genomes. The complete collection of TFs is available at https://entraf.iimas.unam.mx and https://github.com/BioIIMAS/ENTRAF.

## Linked entities

- **Species:** Gammaproteobacteria (taxon 1236), Bacillota (taxon 1239), Actinomycetota (taxon 201174), Thermoproteota (taxon 28889), Thermococci (taxon 183968)

## Full-text entities

- **Chemicals:** carbon (MESH:D002244)
- **Species:** Thermococci (class) [taxon 183968], Thermoproteota (phylum) [taxon 28889]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12569306/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12569306/full.md

## References

82 references — full list in the complete paper: https://tomesphere.com/paper/PMC12569306/full.md

---
Source: https://tomesphere.com/paper/PMC12569306