TL;DR
OgBench is a benchmarking framework designed to evaluate graph neural networks on omics data, focusing on the low-sample, high-node regime typical in biological applications, revealing that GNNs often do not outperform simpler models.
Contribution
This work introduces OgBench, the first standardized platform for benchmarking GNNs on omics data in the challenging $n \\ll p$ regime, and provides insights into their limited advantages.
Findings
Widely used GNNs often do not outperform MLPs and classical baselines.
The structure of biological graphs may not inherently improve predictive performance.
OgBench enables development and validation of models tailored for biological graph data.
Abstract
Graph Neural Networks (GNNs) have become the dominant framework for inductive graph-level learning. Yet most benchmarks focus on the regime , where the number of graphs greatly exceeds the number of nodes per graph . This overlooks biological domains such as omics, which operate in the opposite regime, characterized by large graphs of genes, transcripts, or proteins across few patient samples. This raises the question: \textit{how do GNNs perform in this low-sample, high-node omics setting?} We introduce \texttt{OgBench} (Omics-Graph Bench), the first benchmarking platform for graph-level prediction in the regime characteristic of omics data. We provide a standardized, end-to-end modular infrastructure from raw omics data to families of featured graphs with varied structural properties. We benchmark classical GNNs, as well as GNNs designed for large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
