A Survey of Large Language Models for Data Challenges in Graphs

Mengran Li; Pengyu Zhang; Wenbin Xing; Yijia Zheng; Klim Zaporojets; Junzhou Chen; Ronghui Zhang; Yong Zhang; Siyuan Gong; Jia Hu; Xiaolei Ma; Zhiyuan Liu; Paul Groth; Marcel Worring

arXiv:2505.18475·cs.LG·September 22, 2025

A Survey of Large Language Models for Data Challenges in Graphs

Mengran Li, Pengyu Zhang, Wenbin Xing, Yijia Zheng, Klim Zaporojets, Junzhou Chen, Ronghui Zhang, Yong Zhang, Siyuan Gong, Jia Hu, Xiaolei Ma, Zhiyuan Liu, Paul Groth, Marcel Worring

PDF

1 Repo

TL;DR

This survey reviews how Large Language Models can address key data challenges in graph learning, such as incompleteness, imbalance, heterogeneity, and dynamic changes, highlighting recent advances and future research directions.

Contribution

It provides a comprehensive overview of LLM-driven solutions for fundamental graph data challenges, comparing traditional and modern approaches.

Findings

01

LLMs enhance semantic reasoning in graph tasks

02

Addressing data incompleteness with LLMs improves accuracy

03

Future research should explore LLM integration for dynamic graphs

Abstract

Graphs are a widely used paradigm for representing non-Euclidean data, with applications ranging from social network analysis to biomolecular prediction. While graph learning has achieved remarkable progress, real-world graph data presents a number of challenges that significantly hinder the learning process. In this survey, we focus on four fundamental data-centric challenges: (1) Incompleteness, real-world graphs have missing nodes, edges, or attributes; (2) Imbalance, the distribution of the labels of nodes or edges and their structures for real-world graphs are highly skewed; (3) Cross-domain Heterogeneity, graphs from different domains exhibit incompatible feature spaces or structural patterns; and (4) Dynamic Instability, graphs evolve over time in unpredictable ways. Recently, Large Language Models (LLMs) offer the potential to tackle these challenges by leveraging rich semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

limengran98/awesome-literature-graph-learning-challenges
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.