A Few Words Can Distort Graphs: Knowledge Poisoning Attacks on Graph-based Retrieval-Augmented Generation of Large Language Models
Jiayi Wen, Tianxin Chen, Zhirun Zheng, Cheng Huang

TL;DR
This paper reveals how minimal text modifications can maliciously distort knowledge graphs in GraphRAG, severely misleading LLM-based reasoning, and demonstrates the effectiveness of two novel knowledge poisoning attacks.
Contribution
It introduces two new knowledge poisoning attacks on GraphRAG, showing how small changes can significantly impact graph integrity and downstream tasks.
Findings
Targeted KPA achieves 93.1% success in controlling QA outcomes.
Universal KPA reduces QA accuracy from 95% to 50% with less than 0.05% text modification.
Current defenses fail to detect these poisoning attacks.
Abstract
Graph-based Retrieval-Augmented Generation (GraphRAG) has recently emerged as a promising paradigm for enhancing large language models (LLMs) by converting raw text into structured knowledge graphs, improving both accuracy and explainability. However, GraphRAG relies on LLMs to extract knowledge from raw text during graph construction, and this process can be maliciously manipulated to implant misleading information. Targeting this attack surface, we propose two knowledge poisoning attacks (KPAs) and demonstrate that modifying only a few words in the source text can significantly change the constructed graph, poison the GraphRAG, and severely mislead downstream reasoning. The first attack, named Targeted KPA (TKPA), utilizes graph-theoretic analysis to locate vulnerable nodes in the generated graphs and rewrites the corresponding narratives with LLMs, achieving precise control over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Multimodal Machine Learning Applications
