GLaDiGAtor: Language-Model-Augmented Multi-Relation Graph Learning for Predicting Disease-Gene Associations
Osman Onur Kuzucu, Tunca Do\u{g}an

TL;DR
GLaDiGAtor is a novel graph neural network framework that integrates biological data and language model features to accurately predict disease-gene associations, aiding biomedical research and drug discovery.
Contribution
It introduces a heterogeneous graph model with language model-enriched features for disease-gene prediction, outperforming existing methods.
Findings
Achieves superior predictive accuracy over 14 existing methods.
Successfully identifies biologically relevant novel disease-gene links.
Demonstrates potential to facilitate drug discovery.
Abstract
Understanding disease-gene associations is essential for unravelling disease mechanisms and advancing diagnostics and therapeutics. Traditional approaches based on manual curation and literature review are labour-intensive and not scalable, prompting the use of machine learning on large biomedical data. In particular, graph neural networks (GNNs) have shown promise for modelling complex biological relationships. To address limitations in existing models, we propose GLaDiGAtor (Graph Learning-bAsed DIsease-Gene AssociaTiOn pRediction), a novel GNN framework with an encoder-decoder architecture for disease-gene association prediction. GLaDiGAtor constructs a heterogeneous biological graph integrating gene-gene, disease-disease, and gene-disease interactions from curated databases, and enriches each node with contextual features from well-known language models (ProtT5 for protein sequences…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Advanced Graph Neural Networks · Genetic Associations and Epidemiology
