Comprehensive Metapath-based Heterogeneous Graph Transformer for Gene-Disease Association Prediction
Wentao Cui, Shoubo Li, Chen Fang, Qingqing Long, Chengrui Wang, Xuezhi, Wang, and Yuanchun Zhou

TL;DR
COMET is a novel transformer-based model that integrates diverse biological data and Metapaths to improve gene-disease association predictions, effectively capturing complex network semantics and dependencies.
Contribution
It introduces a comprehensive heterogeneous graph transformer leveraging multiple Metapaths and BioGPT features for enhanced gene-disease association prediction.
Findings
Outperforms state-of-the-art methods in accuracy and robustness.
Effectively captures long-distance dependencies in biological networks.
Provides interpretability through attention mechanisms.
Abstract
Discovering gene-disease associations is crucial for understanding disease mechanisms, yet identifying these associations remains challenging due to the time and cost of biological experiments. Computational methods are increasingly vital for efficient and scalable gene-disease association prediction. Graph-based learning models, which leverage node features and network relationships, are commonly employed for biomolecular predictions. However, existing methods often struggle to effectively integrate node features, heterogeneous structures, and semantic information. To address these challenges, we propose COmprehensive MEtapath-based heterogeneous graph Transformer(COMET) for predicting gene-disease associations. COMET integrates diverse datasets to construct comprehensive heterogeneous networks, initializing node features with BioGPT. We define seven Metapaths and utilize a transformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Gene expression and cancer classification · Machine Learning in Bioinformatics
MethodsSoftmax · Attention Is All You Need
