GRAPHTEXTACK: A Realistic Black-Box Node Injection Attack on LLM-Enhanced GNNs
Jiaji Ma, Puja Trivedi, Danai Koutra

TL;DR
This paper introduces GRAPHTEXTACK, a novel black-box, multi-modal node injection attack targeting LLM-enhanced GNNs, demonstrating significant degradation of model performance without requiring internal model access.
Contribution
It presents the first realistic, black-box poisoning attack on LLM-GNNs using an evolutionary optimization framework for multi-modal node injection.
Findings
Outperforms 12 baseline attacks across five datasets
Effectively degrades LLM-GNN performance in a black-box setting
Demonstrates vulnerability of LLM-GNNs to multi-modal poisoning attacks
Abstract
Text-attributed graphs (TAGs), which combine structural and textual node information, are ubiquitous across many domains. Recent work integrates Large Language Models (LLMs) with Graph Neural Networks (GNNs) to jointly model semantics and structure, resulting in more general and expressive models that achieve state-of-the-art performance on TAG benchmarks. However, this integration introduces dual vulnerabilities: GNNs are sensitive to structural perturbations, while LLM-derived features are vulnerable to prompt injection and adversarial phrasing. While existing adversarial attacks largely perturb structure or text independently, we find that uni-modal attacks cause only modest degradation in LLM-enhanced GNNs. Moreover, many existing attacks assume unrealistic capabilities, such as white-box access or direct modification of graph data. To address these gaps, we propose GRAPHTEXTACK,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Adversarial Robustness in Machine Learning · Topic Modeling
