# Integrating gene expression, genomic, and phosphoproteomic data to infer transcription factor activity in lung cancer

**Authors:** Chiara Carrino, Gerardo Pepe, Luca Parca, Manuela Helmer-Citterich, Pier Federico Gherardini

PMC · DOI: 10.1093/nargab/lqaf068 · NAR Genomics and Bioinformatics · 2025-05-30

## TL;DR

This study combines gene, genomic, and protein data to identify key transcription factors involved in lung cancer and their impact on patient survival.

## Contribution

A novel data integration approach to infer transcription factor activity in lung adenocarcinoma using multi-omics data.

## Key findings

- 34 transcription factors showed perturbed activity in lung cancer samples based on target gene expression.
- Phosphorylation events were linked to modulation of transcription factor activity.
- ERG was identified as a key regulator in lung adenocarcinoma with strong survival correlation.

## Abstract

Transcription factors (TFs) are key regulators of cellular gene expression programs in health and disease. Here we set out to integrate genomic, transcriptomic, and phosphoproteomic data to characterize TF activity in lung adenocarcinoma patients. Using expression data from patient samples and genomic information on TF binding to super-enhancers, starting from a list of 1667 human TFs we calculated a patient-specific activity score and identified 34 with perturbed activity in the cancer samples, as evidenced by the expression of their direct targets. We then leveraged phosphoproteomic data on the same samples to identify phosphorylation events that modulate TF activity. This novel data integration approach to TF characterization led to the identification of ERG as a key regulator in lung adenocarcinoma whose activity strongly correlates with patient survival.

## Linked entities

- **Genes:** ERG (ETS transcription factor ERG) [NCBI Gene 2078]
- **Diseases:** lung adenocarcinoma (MONDO:0005061)

## Full-text entities

- **Genes:** ERG (ETS transcription factor ERG) [NCBI Gene 2078] {aka LMPHM14, erg-3, p55}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}
- **Diseases:** lung adenocarcinoma (MESH:D000077192), cancer (MESH:D009369), lung cancer (MESH:D008175)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12123410/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12123410/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/PMC12123410/full.md

---
Source: https://tomesphere.com/paper/PMC12123410