Text-space Graph Foundation Models: Comprehensive Benchmarks and New Insights
Zhikai Chen, Haitao Mao, Jingzhe Liu, Yu Song, Bingheng Li, Wei Jin,, Bahare Fatemi, Anton Tsitsulin, Bryan Perozzi, Hui Liu, Jiliang Tang

TL;DR
This paper introduces a comprehensive benchmark for text-space Graph Foundation Models, providing new datasets and evaluation protocols to better understand their effectiveness across diverse graph tasks.
Contribution
It presents the first unified benchmark with novel datasets and evaluation settings for text-space GFMs, enabling fair comparison and deeper insights.
Findings
New insights into GFM effectiveness across tasks
Identification of key challenges in current models
Benchmark datasets facilitate future research
Abstract
Given the ubiquity of graph data and its applications in diverse domains, building a Graph Foundation Model (GFM) that can work well across different graphs and tasks with a unified backbone has recently garnered significant interests. A major obstacle to achieving this goal stems from the fact that graphs from different domains often exhibit diverse node features. Inspired by multi-modal models that align different modalities with natural language, the text has recently been adopted to provide a unified feature space for diverse graphs. Despite the great potential of these text-space GFMs, current research in this field is hampered by two problems. First, the absence of a comprehensive benchmark with unified problem settings hinders a clear understanding of the comparative effectiveness and practical value of different text-space GFMs. Second, there is a lack of sufficient datasets to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDistributed and Parallel Computing Systems · DNA and Biological Computing · Cellular Automata and Applications
MethodsALIGN
