Rethinking Cancer Gene Identification through Graph Anomaly Analysis
Yilong Zang, Lingfei Ren, Yue Li, Zhikang Wang, David Antony Selby, Zheng Wang, Sebastian Josef Vollmer, Hongzhi Yin, Jiangning Song, Junhang Wu

TL;DR
This paper introduces HIPGNN, a novel graph neural network that captures biological anomalies like weight heterogeneity in protein interaction networks to improve cancer gene identification.
Contribution
It pioneers the modeling of biological anomalies as graph anomalies and develops HIPGNN to better analyze complex protein interaction patterns for cancer genes.
Findings
HIPGNN outperforms existing methods on STRINGdb and CPDB datasets.
Weight heterogeneity is a distinctive anomaly in cancer gene graphs.
Spectral analysis reveals energy concentration shifts due to anomalies.
Abstract
Graph neural networks (GNNs) have shown promise in integrating protein-protein interaction (PPI) networks for identifying cancer genes in recent studies. However, due to the insufficient modeling of the biological information in PPI networks, more faithfully depiction of complex protein interaction patterns for cancer genes within the graph structure remains largely unexplored. This study takes a pioneering step toward bridging biological anomalies in protein interactions caused by cancer genes to statistical graph anomaly. We find a unique graph anomaly exhibited by cancer genes, namely weight heterogeneity, which manifests as significantly higher variance in edge weights of cancer gene nodes within the graph. Additionally, from the spectral perspective, we demonstrate that the weight heterogeneity could lead to the "flattening out" of spectral energy, with a concentration towards the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsBioinformatics and Genomic Networks · Gene expression and cancer classification · Machine Learning in Bioinformatics
MethodsGraph Neural Network
