Model Generalization on Text Attribute Graphs: Principles with Large Language Models

Haoyu Wang; Shikun Liu; Rongzhe Wei; Pan Li

arXiv:2502.11836·cs.LG·June 10, 2025

Model Generalization on Text Attribute Graphs: Principles with Large Language Models

Haoyu Wang, Shikun Liu, Rongzhe Wei, Pan Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces LLM-BP, a framework leveraging large language models for better generalization on text-attributed graphs by unifying attribute spaces and developing adaptive information aggregation, showing significant improvements on real-world benchmarks.

Contribution

The paper proposes a novel framework, LLM-BP, that enhances generalization on text-attributed graphs using LLM-based encoders, task-aware prompting, and belief propagation with adaptive parameters.

Findings

01

LLM-BP outperforms existing methods on 11 real-world benchmarks.

02

Achieves 8.10% improvement with task-conditional embeddings.

03

Gains an additional 1.71% from adaptive aggregation.

Abstract

Large language models (LLMs) have recently been introduced to graph learning, aiming to extend their zero-shot generalization success to tasks where labeled graph data is scarce. Among these applications, inference over text-attributed graphs (TAGs) presents unique challenges: existing methods struggle with LLMs' limited context length for processing large node neighborhoods and the misalignment between node embeddings and the LLM token space. To address these issues, we establish two key principles for ensuring generalization and derive the framework LLM-BP accordingly: (1) Unifying the attribute space with task-adaptive embeddings, where we leverage LLM-based encoders and task-aware prompting to enhance generalization of the text attribute embeddings; (2) Developing a generalizable graph information aggregation mechanism, for which we adopt belief propagation with LLM-estimated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

graph-com/llm_bp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsADaptive gradient method with the OPTimal convergence rate