Quantizing Text-attributed Graphs for Semantic-Structural Integration

Jianyuan Bo; Hao Wu; Yuan Fang

arXiv:2507.19526·cs.LG·July 29, 2025

Quantizing Text-attributed Graphs for Semantic-Structural Integration

Jianyuan Bo, Hao Wu, Yuan Fang

PDF

TL;DR

This paper introduces STAG, a self-supervised graph quantization framework that encodes structural information into discrete tokens, enabling effective zero-shot transfer learning with large language models for graph tasks.

Contribution

STAG presents a novel soft assignment and KL divergence guided quantization method for graph data, facilitating LLM integration without labeled data or manual graph verbalization.

Findings

01

Achieves state-of-the-art node classification results

02

Supports zero-shot transfer learning without labeled source data

03

Maintains compatibility across various LLM architectures

Abstract

Text-attributed graphs (TAGs) have emerged as a powerful representation for modeling complex relationships across diverse domains. With the rise of large language models (LLMs), there is growing interest in leveraging their capabilities for graph learning. However, current approaches face significant challenges in embedding structural information into LLM-compatible formats, requiring either computationally expensive alignment mechanisms or manual graph verbalization techniques that often lose critical structural details. Moreover, these methods typically require labeled data from source domains for effective transfer learning, significantly constraining their adaptability. We propose STAG, a novel self-supervised framework that directly quantizes graph structural information into discrete tokens using a frozen codebook. Unlike traditional quantization approaches, our method employs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.