EPGAT: Gene Essentiality Prediction With Graph Attention Networks
Jo\~ao Schapke, Anderson Tavares, Mariana Recamonde-Mendoza

TL;DR
EPGAT employs Graph Attention Networks to accurately predict gene essentiality from PPI networks and multiomics data, outperforming traditional methods and demonstrating robustness with limited data across multiple organisms.
Contribution
The paper introduces EPGAT, a novel GAT-based model that directly learns from graph-structured biological data for essential gene prediction, surpassing existing methods.
Findings
Achieved AUC scores from 0.78 to 0.97 across four organisms.
Outperformed network-based and shallow ML methods.
Showed robustness with limited and imbalanced training data.
Abstract
The identification of essential genes/proteins is a critical step towards a better understanding of human biology and pathology. Computational approaches helped to mitigate experimental constraints by exploring machine learning (ML) methods and the correlation of essentiality with biological information, especially protein-protein interaction (PPI) networks, to predict essential genes. Nonetheless, their performance is still limited, as network-based centralities are not exclusive proxies of essentiality, and traditional ML methods are unable to learn from non-Euclidean domains such as graphs. Given these limitations, we proposed EPGAT, an approach for essentiality prediction based on Graph Attention Networks (GATs), which are attention-based Graph Neural Networks (GNNs) that operate on graph-structured data. Our model directly learns patterns of gene essentiality from PPI networks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodsnode2vec
