Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs

Zihui Chen; Yuling Wang; Pengfei Jiao; Kai Wu; Xiao Wang; Xiang Ao; Dalin Zhang

arXiv:2603.21155·cs.AI·March 24, 2026

Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs

Zihui Chen, Yuling Wang, Pengfei Jiao, Kai Wu, Xiao Wang, Xiang Ao, Dalin Zhang

PDF

Open Access

TL;DR

This paper introduces BadGraph, a universal adversarial attack framework that exploits large language models to perturb both the structure and text of graph data, revealing vulnerabilities in text-attributed graph models across different architectures.

Contribution

The paper proposes BadGraph, a novel LLM-based attack method that effectively attacks diverse TAG models by perturbing topology and semantics, demonstrating universal vulnerabilities.

Findings

01

Achieves up to 76.3% performance drop in attacked models.

02

Effective across GNN and LLM-based graph reasoners.

03

Stealthy and interpretable attack mechanism.

Abstract

Text-attributed graphs (TAGs) enhance graph learning by integrating rich textual semantics and topological context for each node. While boosting expressiveness, they also expose new vulnerabilities in graph learning through text-based adversarial surfaces. Recent advances leverage diverse backbones, such as graph neural networks (GNNs) and pre-trained language models (PLMs), to capture both structural and textual information in TAGs. This diversity raises a key question: How can we design universal adversarial attacks that generalize across architectures to assess the security of TAG models? The challenge arises from the stark contrast in how different backbones-GNNs and PLMs-perceive and encode graph patterns, coupled with the fact that many PLMs are only accessible via APIs, limiting attacks to black-box settings. To address this, we propose BadGraph, a novel attack framework that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Adversarial Robustness in Machine Learning · Topic Modeling