OpenGraph: Towards Open Graph Foundation Models
Lianghao Xia, Ben Kao, Chao Huang

TL;DR
OpenGraph introduces a novel foundation model for graphs that leverages data augmentation with large language models, a unified graph tokenizer, and scalable transformers to enable effective zero-shot learning across diverse and unseen graph data.
Contribution
The paper presents a new graph foundation model, OpenGraph, that improves generalization to unseen graph data through innovative data augmentation, a unified tokenizer, and scalable transformer architecture.
Findings
Achieves state-of-the-art zero-shot performance on various graph tasks.
Effectively generalizes to unseen graph properties and datasets.
Demonstrates robustness across diverse graph domains.
Abstract
Graph learning has become essential in various domains, including recommendation systems and social network analysis. Graph Neural Networks (GNNs) have emerged as promising techniques for encoding structural information and improving performance in tasks like link prediction and node classification. However, a key challenge remains: the difficulty of generalizing to unseen graph data with different properties. In this work, we propose a novel graph foundation model, called OpenGraph, to address this challenge. Our approach tackles several technical obstacles. Firstly, we enhance data augmentation using a large language model (LLM) to overcome data scarcity in real-world scenarios. Secondly, we introduce a unified graph tokenizer that enables the model to generalize effectively to diverse graph data, even when encountering unseen properties during training. Thirdly, our developed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Semantic Web and Ontologies · Distributed and Parallel Computing Systems
MethodsAttention Is All You Need · Laplacian EigenMap · Linear Layer · Layer Normalization · Byte Pair Encoding · Dropout · Multi-Head Attention · Laplacian Positional Encodings · Softmax · Dense Connections
