Text-space Graph Foundation Models: Comprehensive Benchmarks and New   Insights

Zhikai Chen; Haitao Mao; Jingzhe Liu; Yu Song; Bingheng Li; Wei Jin,; Bahare Fatemi; Anton Tsitsulin; Bryan Perozzi; Hui Liu; Jiliang Tang

arXiv:2406.10727·cs.LG·June 18, 2024

Text-space Graph Foundation Models: Comprehensive Benchmarks and New Insights

Zhikai Chen, Haitao Mao, Jingzhe Liu, Yu Song, Bingheng Li, Wei Jin,, Bahare Fatemi, Anton Tsitsulin, Bryan Perozzi, Hui Liu, Jiliang Tang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a comprehensive benchmark for text-space Graph Foundation Models, providing new datasets and evaluation protocols to better understand their effectiveness across diverse graph tasks.

Contribution

It presents the first unified benchmark with novel datasets and evaluation settings for text-space GFMs, enabling fair comparison and deeper insights.

Findings

01

New insights into GFM effectiveness across tasks

02

Identification of key challenges in current models

03

Benchmark datasets facilitate future research

Abstract

Given the ubiquity of graph data and its applications in diverse domains, building a Graph Foundation Model (GFM) that can work well across different graphs and tasks with a unified backbone has recently garnered significant interests. A major obstacle to achieving this goal stems from the fact that graphs from different domains often exhibit diverse node features. Inspired by multi-modal models that align different modalities with natural language, the text has recently been adopted to provide a unified feature space for diverse graphs. Despite the great potential of these text-space GFMs, current research in this field is hampered by two problems. First, the absence of a comprehensive benchmark with unified problem settings hinders a clear understanding of the comparative effectiveness and practical value of different text-space GFMs. Second, there is a lack of sufficient datasets to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

currytang/tsgfm
pytorchOfficial

Videos

Text-space Graph Foundation Models: Comprehensive Benchmarks and New Insights· slideslive

Taxonomy

TopicsDistributed and Parallel Computing Systems · DNA and Biological Computing · Cellular Automata and Applications

MethodsALIGN