Exploring Graph Learning Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation

Yuxiang Wang; Xinnan Dai; Wenqi Fan; Yao Ma

arXiv:2502.18771·cs.LG·January 28, 2026

Exploring Graph Learning Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation

Yuxiang Wang, Xinnan Dai, Wenqi Fan, Yao Ma

PDF

Open Access 1 Repo

TL;DR

This paper comprehensively evaluates large language models on graph learning tasks, revealing their strengths in few-shot learning, domain transfer, and robustness, and compares them to traditional graph models.

Contribution

It provides the first extensive benchmark of LLMs on graph tasks, analyzing their performance, data concerns, and potential beyond traditional methods.

Findings

01

LLMs outperform traditional models in few-shot scenarios

02

Instruction-tuned LLMs show strong domain transferability

03

LLMs demonstrate high robustness and generalization

Abstract

In recent years, large language models (LLMs) have emerged as promising candidates for graph tasks. Many studies leverage natural language to describe graphs and apply LLMs for reasoning, yet most focus narrowly on performance benchmarks without fully comparing LLMs to graph learning models or exploring their broader potential. In this work, we present a comprehensive study of LLMs on graph learning tasks, evaluating both off-the-shelf and instruction-tuned models across a variety of scenarios. Beyond accuracy, we discuss data leakage concerns and computational overhead, and assess their performance under few-shot/zero-shot settings, domain transfer, structural understanding, and robustness. Our findings show that LLMs, particularly those with instruction tuning, greatly outperform traditional graph learning models in few-shot settings, exhibit strong domain transferability, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

myflashbarry/LLM-benchmarking
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Multimodal Machine Learning Applications

MethodsFocus