# GRAFT: a graph-aware fusion transformer for cancer driver gene prediction

**Authors:** Sang-Pil Cho, Young-Rae Cho

PMC · DOI: 10.1093/bib/bbaf706 · Briefings in Bioinformatics · 2026-01-11

## TL;DR

GRAFT is a new model that improves cancer driver gene prediction by combining multiple biological data sources and capturing both local and global molecular dependencies.

## Contribution

GRAFT introduces a graph-aware fusion transformer with edge-attention bias for multimodal integration in cancer driver gene discovery.

## Key findings

- GRAFT achieves competitive performance with state-of-the-art methods in pan-cancer analysis.
- The model shows superior predictive accuracy across multiple specific cancer types.
- Predicted candidate driver genes are biologically relevant and associated with cancer-related processes.

## Abstract

Identifying cancer driver genes is essential for precision oncology, but existing computational methods are often limited by their reliance on single biological networks and their inability to capture long-range molecular dependencies. To address these challenges, we propose GRAFT, a Graph-Aware Fusion Transformer. This framework learns modality-specific features from protein-protein interactions, pathway co-occurrence, and gene semantic similarity using a multi-view graph encoder. These representations are further enriched with two auxiliary feature types: structural encodings derived from network topology and functional embeddings guided by curated gene sets. The integrated features are then processed by a transformer backbone, where a novel edge-attention bias makes the model explicitly sensitive to the underlying graph topologies, enabling the effective modeling of both local and global dependencies. Extensive evaluations demonstrate that GRAFT achieves competitive performance with leading state-of-the-art methods in pan-cancer analysis, while consistently delivering superior predictive accuracy across numerous specific cancer types. More importantly, a functional enrichment analysis of the novel candidate driver genes predicted by our model confirms their strong associations with key cancer-related processes, demonstrating the model’s ability to make biologically plausible discoveries. By delivering a powerful and interpretable framework, our model not only advances the identification of cancer driver genes but also establishes a robust paradigm for multimodal data integration in systems biology. The source codes and datasets are publicly accessible at https://github.com/spcho-dev/GRAFT.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12790624/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12790624/full.md

## References

50 references — full list in the complete paper: https://tomesphere.com/paper/PMC12790624/full.md

---
Source: https://tomesphere.com/paper/PMC12790624