UniGLM: Training One Unified Language Model for Text-Attributed Graph   Embedding

Yi Fang; Dongzhe Fan; Sirui Ding; Ninghao Liu; Qiaoyu Tan

arXiv:2406.12052·cs.CL·December 24, 2024·2 cites

UniGLM: Training One Unified Language Model for Text-Attributed Graph Embedding

Yi Fang, Dongzhe Fan, Sirui Ding, Ninghao Liu, Qiaoyu Tan

PDF

Open Access 1 Repo

TL;DR

UniGLM is a novel unified language model trained on multiple text-attributed graphs using contrastive learning, enabling effective generalization and transfer learning across diverse graph scenarios.

Contribution

This paper introduces UniGLM, the first graph embedding model that generalizes across in-domain and cross-domain TAGs through multi-graph training and contrastive learning.

Findings

01

Outperforms existing baselines on 9 benchmark TAGs.

02

Demonstrates strong generalization across various downstream tasks.

03

Effective in both in-domain and out-of-domain transfer scenarios.

Abstract

Representation learning on text-attributed graphs (TAGs), where nodes are represented by textual descriptions, is crucial for textual and relational knowledge systems and recommendation systems. Currently, state-of-the-art embedding methods for TAGs primarily focus on fine-tuning language models (e.g., BERT) using structure-aware training signals. While effective, these methods are tailored for individual TAG and cannot generalize across various graph scenarios. Given the shared textual space, leveraging multiple TAGs for joint fine-tuning, aligning text and graph structure from different aspects, would be more beneficial. Motivated by this, we introduce a novel Unified Graph Language Model (UniGLM) framework, the first graph embedding model that generalizes well to both in-domain and cross-domain TAGs. Specifically, UniGLM is trained over multiple TAGs with different domains and scales…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nyushcs/uniglm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Semantic Web and Ontologies

MethodsFocus