OpenGraph: Towards Open Graph Foundation Models

Lianghao Xia; Ben Kao; Chao Huang

arXiv:2403.01121·cs.LG·October 10, 2024·1 cites

OpenGraph: Towards Open Graph Foundation Models

Lianghao Xia, Ben Kao, Chao Huang

PDF

Open Access 1 Repo

TL;DR

OpenGraph introduces a novel foundation model for graphs that leverages data augmentation with large language models, a unified graph tokenizer, and scalable transformers to enable effective zero-shot learning across diverse and unseen graph data.

Contribution

The paper presents a new graph foundation model, OpenGraph, that improves generalization to unseen graph data through innovative data augmentation, a unified tokenizer, and scalable transformer architecture.

Findings

01

Achieves state-of-the-art zero-shot performance on various graph tasks.

02

Effectively generalizes to unseen graph properties and datasets.

03

Demonstrates robustness across diverse graph domains.

Abstract

Graph learning has become essential in various domains, including recommendation systems and social network analysis. Graph Neural Networks (GNNs) have emerged as promising techniques for encoding structural information and improving performance in tasks like link prediction and node classification. However, a key challenge remains: the difficulty of generalizing to unseen graph data with different properties. In this work, we propose a novel graph foundation model, called OpenGraph, to address this challenge. Our approach tackles several technical obstacles. Firstly, we enhance data augmentation using a large language model (LLM) to overcome data scarcity in real-world scenarios. Secondly, we introduce a unified graph tokenizer that enables the model to generalize effectively to diverse graph data, even when encountering unseen properties during training. Thirdly, our developed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hkuds/opengraph
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Semantic Web and Ontologies · Distributed and Parallel Computing Systems

MethodsAttention Is All You Need · Laplacian EigenMap · Linear Layer · Layer Normalization · Byte Pair Encoding · Dropout · Multi-Head Attention · Laplacian Positional Encodings · Softmax · Dense Connections