# ONCOPLEX: an oncology-inspired hypergraph model integrating diverse biological knowledge for cancer driver gene prediction

**Authors:** Etab Mohammed Alotaibi, Omer S. Alkhnbashi, Van Dinh Tran

PMC · DOI: 10.1038/s41598-026-36127-8 · Scientific Reports · 2026-01-13

## TL;DR

ONCOPLEX is a new AI model that uses complex biological networks to better identify genes that cause cancer.

## Contribution

ONCOPLEX introduces a novel hypergraph-based framework integrating diverse biological data for improved cancer driver gene prediction.

## Key findings

- ONCOPLEX outperforms existing methods in identifying cancer driver genes across various cancer types.
- The model accurately recovers known driver genes and identifies new candidates supported by biological evidence.

## Abstract

Cancer development is driven by a small subset of somatic mutations, known as driver mutations, that disrupt key regulatory processes in cells. These mutations occur in specific genes, called cancer driver genes, whose altered functions promote tumor initiation and progression. Accurately identifying driver genes remains a major challenge due to their rarity and the overwhelming presence of passenger mutations. Recent advances in graph-based deep learning have improved the modeling of gene interactions, but most approaches are limited to pairwise connections and fail to capture the higher-order complexity of biological systems. We introduce ONCOPLEX, a hypergraph-based neural network framework that models genes as nodes and curated cancer-related pathways as hyperedges, enabling the representation of multi-gene interactions. Unlike previous methods, ONCOPLEX integrates diverse molecular and phenotypic features, such as somatic mutations, gene expression, and DNA methylation, into a pathway-informed hypergraph structure to learn biologically meaningful gene representations. ONCOPLEX is trained in a supervised manner on labeled driver and non-driver genes, with unlabeled genes included as nodes during representation learning. Comprehensive evaluations across pan-cancer and cancer-type-specific settings show that ONCOPLEX consistently outperforms state-of-the-art methods in classification and ranking metrics. It accurately recovers known driver genes and highlights novel candidates supported by literature and enrichment analyses. These findings underscore the power of pathway-guided hypergraph modeling for advancing cancer driver gene discovery.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Genes:** BRAF (B-Raf proto-oncogene, serine/threonine kinase) [NCBI Gene 673] {aka B-RAF1, B-raf, BRAF-1, BRAF1, NS7, RAFB1}, SRC (SRC proto-oncogene, non-receptor tyrosine kinase) [NCBI Gene 6714] {aka ASV, SRC1, THC6, c-SRC, p60-Src}, JUN (Jun proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 3725] {aka AP-1, AP1, c-Jun, cJUN, p39}, STAT3 (signal transducer and activator of transcription 3) [NCBI Gene 6774] {aka ADMIO, ADMIO1, APRF, HIES}, BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}, PIK3R1 (phosphoinositide-3-kinase regulatory subunit 1) [NCBI Gene 5295] {aka AGM7, GRB1, IMD36, p85, p85-ALPHA, p85alpha}, GAB1 (GRB2 associated binding protein 1) [NCBI Gene 2549] {aka DFNB26}, AKT1 (AKT serine/threonine kinase 1) [NCBI Gene 207] {aka AKT, PKB, PKB-ALPHA, PRKBA, RAC, RAC-ALPHA}, UBA52 (ubiquitin A-52 residue ribosomal protein fusion product 1) [NCBI Gene 7311] {aka CEP52, HUBCEP52, L40, RPL40}, PIK3CB (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit beta) [NCBI Gene 5291] {aka P110BETA, PI3K, PI3KBETA, PIK3C1}, SHC1 (SHC adaptor protein 1) [NCBI Gene 6464] {aka SHC, SHCA}, MAPK1 (mitogen-activated protein kinase 1) [NCBI Gene 5594] {aka ERK, ERK-2, ERK2, ERT1, MAPK2, NS13}, RUNX1 (RUNX family transcription factor 1) [NCBI Gene 861] {aka AML1, AML1-EVI-1, AMLCR1, CBF2alpha, CBFA2, EVI-1}, MTOR (mechanistic target of rapamycin kinase) [NCBI Gene 2475] {aka FRAP, FRAP1, FRAP2, RAFT1, RAPT1, SKS}, NRAS (NRAS proto-oncogene, GTPase) [NCBI Gene 4893] {aka ALPS4, CMNS, N-ras, NCMS, NRAS1, NS6}, MAPK3 (mitogen-activated protein kinase 3) [NCBI Gene 5595] {aka ERK-1, ERK1, ERT2, HS44KDAP, HUMKER1A, P44ERK1}, GSK3B (glycogen synthase kinase 3 beta) [NCBI Gene 2932], KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}, EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}, PRL (prolactin) [NCBI Gene 5617] {aka GHA1, pPRL}, GRB2 (growth factor receptor bound protein 2) [NCBI Gene 2885] {aka ASH, EGFRBP-GRB2, Grb3-3, MST084, MSTP084, NCKAP2}
- **Diseases:** THCA (MESH:D013964), HNSC (MESH:D000077195), LUSC (MESH:D002294), LIHC (MESH:D006528), Cancer (MESH:D009369), STAD (MESH:D013274), liver tumor (MESH:D008113), BLCA (MESH:D001749), PRAD (MESH:D000230), prostate cancer (MESH:D011471), chronic myeloid leukemia (MESH:D015464), death (MESH:D003643), oncogenes (MESH:D000074723), liver-related diseases (MESH:D008107), COAD (MESH:D029424), disease (MESH:D004194), ESCA (MESH:D004938), LUAD (MESH:D000077192), breast cancer (MESH:D001943)
- **Chemicals:** ONCOPLEX (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12881423/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12881423/full.md

## References

3 references — full list in the complete paper: https://tomesphere.com/paper/PMC12881423/full.md

---
Source: https://tomesphere.com/paper/PMC12881423