# Integrative GWAS and RNA-Seq analysis for target identification and virtual drug screening in colorectal cancer

**Authors:** Qinghui Liu, Yiyang Lei, Zixuan Liu, Jiale Han, Zhengrui Li, Zhengrui Li, Zhengrui Li, Zhengrui Li

PMC · DOI: 10.1371/journal.pone.0333179 · 2025-10-27

## TL;DR

This study combines genetic and RNA data to identify key genes in colorectal cancer and finds potential drug targets, including PYGL, for new therapies.

## Contribution

The novel integration of GWAS and RNA-seq data identifies CRC-associated genes and prioritizes PYGL as a druggable target through virtual screening.

## Key findings

- 24 CRC-associated genes, including PYGL, SMAD7, and TCF7L2, are involved in tumor metabolism and Wnt/TCF signaling.
- Five genes (CDKN2B, BOC, METRNL, etc.) are significantly correlated with survival outcomes in CRC patients.
- Ten small-molecule candidates targeting PYGL show high binding affinity, suggesting therapeutic potential.

## Abstract

Colorectal cancer (CRC) is a leading cause of global cancer-related mortality, necessitating the identification of novel therapeutic targets. Integrating genetic and transcriptomic data may reveal key molecular drivers of CRC progression and treatment opportunities.

We performed a multiomics analysis combining genome-wide association study (GWAS) data (p < 1e-6) and RNA-seq data from the TCGA. Differential expression analysis (Limma) identified 24 consistently dysregulated genes (17 mRNAs, 7 lncRNAs) in CRC. Survival analysis was used to evaluate their prognostic impact on overall survival (OS), relapse-free survival (RFS), and post progression survival (PPS). Drug‒gene interactions were explored via Enrichr, and virtual screening (PubChem) prioritized high-affinity compounds that target PYGL, a metabolic regulator.

Integration of GWAS and RNA-seq revealed that 24 CRC-associated genes, including PYGL, SMAD7, and TCF7L2, are involved in tumor metabolism and Wnt/TCF signaling. Survival analysis revealed that five genes (CDKN2B, BOC, METRNL, etc.) were significantly correlated with OS, RFS, and PPS. Ten small-molecule candidates targeting PYGL exhibited high binding affinity, suggesting their therapeutic potential.

This study identified CRC-linked genes through GWASs and transcriptomics, highlighting their prognostic and druggable relevance. Computational drug repurposing pinpoints PYGL inhibitors as promising candidates, offering a translational framework for CRC therapy development.

## Linked entities

- **Genes:** PYGL (glycogen phosphorylase L) [NCBI Gene 5836], SMAD7 (SMAD family member 7) [NCBI Gene 4092], TCF7L2 (transcription factor 7 like 2) [NCBI Gene 6934], CDKN2B (cyclin dependent kinase inhibitor 2B) [NCBI Gene 1030], BOC (BOC cell adhesion associated, oncogene regulated) [NCBI Gene 91653], METRNL (meteorin like, glial cell differentiation regulator) [NCBI Gene 284207]
- **Diseases:** colorectal cancer (MONDO:0005575)

## Full-text entities

- **Genes:** HNF4A (hepatocyte nuclear factor 4 alpha) [NCBI Gene 3172] {aka FRTS4, HNF4, HNF4a7, HNF4a8, HNF4a9, HNF4alpha}, SMAD7 (SMAD family member 7) [NCBI Gene 4092] {aka CRCS3, MADH7, MADH8}, TCF7L2 (transcription factor 7 like 2) [NCBI Gene 6934] {aka TCF-4, TCF4}, PYGL (glycogen phosphorylase L) [NCBI Gene 5836] {aka GSD6}, METRNL (meteorin like, glial cell differentiation regulator) [NCBI Gene 284207], BOC (BOC cell adhesion associated, oncogene regulated) [NCBI Gene 91653] {aka Boi, CDON2}, CDKN2B (cyclin dependent kinase inhibitor 2B) [NCBI Gene 1030] {aka CDK4I, INK4B, MTS2, P15, TP15, p15INK4b}
- **Diseases:** cancer (MESH:D009369), CRC (MESH:D015179)

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12558462/full.md

---
Source: https://tomesphere.com/paper/PMC12558462