TL;DR
This survey reviews how Large Language Models can address key data challenges in graph learning, such as incompleteness, imbalance, heterogeneity, and dynamic changes, highlighting recent advances and future research directions.
Contribution
It provides a comprehensive overview of LLM-driven solutions for fundamental graph data challenges, comparing traditional and modern approaches.
Findings
LLMs enhance semantic reasoning in graph tasks
Addressing data incompleteness with LLMs improves accuracy
Future research should explore LLM integration for dynamic graphs
Abstract
Graphs are a widely used paradigm for representing non-Euclidean data, with applications ranging from social network analysis to biomolecular prediction. While graph learning has achieved remarkable progress, real-world graph data presents a number of challenges that significantly hinder the learning process. In this survey, we focus on four fundamental data-centric challenges: (1) Incompleteness, real-world graphs have missing nodes, edges, or attributes; (2) Imbalance, the distribution of the labels of nodes or edges and their structures for real-world graphs are highly skewed; (3) Cross-domain Heterogeneity, graphs from different domains exhibit incompatible feature spaces or structural patterns; and (4) Dynamic Instability, graphs evolve over time in unpredictable ways. Recently, Large Language Models (LLMs) offer the potential to tackle these challenges by leveraging rich semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
