NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge

Hanyu Zhu; Lance Fiondella; Jiawei Yuan; Kai Zeng; Long Jiao

arXiv:2510.21144·cs.AI·January 13, 2026

NeuroGenPoisoning: Neuron-Guided Attacks on Retrieval-Augmented Generation of LLM via Genetic Optimization of External Knowledge

Hanyu Zhu, Lance Fiondella, Jiawei Yuan, Kai Zeng, Long Jiao

PDF

Open Access

TL;DR

NeuroGenPoisoning introduces a neuron-guided genetic optimization attack on RAG systems, effectively injecting poisoned knowledge with high success rates while maintaining fluency and resolving knowledge conflicts.

Contribution

This work presents the first neuron-level, genetic algorithm-based poisoning framework for RAG, leveraging internal neuron attribution to generate highly effective adversarial knowledge.

Findings

01

Achieves over 90% success rate in poisoning RAG models

02

Effectively resolves knowledge conflicts in adversarial knowledge

03

Maintains fluency of generated adversarial passages

Abstract

Retrieval-Augmented Generation (RAG) empowers Large Language Models (LLMs) to dynamically integrate external knowledge during inference, improving their factual accuracy and adaptability. However, adversaries can inject poisoned external knowledge to override the model's internal memory. While existing attacks iteratively manipulate retrieval content or prompt structure of RAG, they largely ignore the model's internal representation dynamics and neuron-level sensitivities. The underlying mechanism of RAG poisoning has not been fully studied and the effect of knowledge conflict with strong parametric knowledge in RAG is not considered. In this work, we propose NeuroGenPoisoning, a novel attack framework that generates adversarial external knowledge in RAG guided by LLM internal neuron attribution and genetic optimization. Our method first identifies a set of Poison-Responsive Neurons…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Advanced Graph Neural Networks