Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space
Xingfu Zhou, Pengfei Wang

TL;DR
This paper uncovers a novel attack on LLM agents that manipulates their reasoning style through stealthy document rewriting, significantly degrading performance and bypassing filters, and proposes a real-time monitoring defense mechanism.
Contribution
It introduces Reasoning-Style Poisoning (RSP), a new process-oriented attack on LLMs, and develops RSV metrics and a runtime monitor for detection and defense.
Findings
GSI significantly degrades LLM reasoning performance.
GSI can bypass content filters effectively.
RSV metrics enable real-time detection of reasoning style manipulation.
Abstract
Large Language Model (LLM) agents relying on external retrieval are increasingly deployed in high-stakes environments. While existing adversarial attacks primarily focus on content falsification or instruction injection, we identify a novel, process-oriented attack surface: the agent's reasoning style. We propose Reasoning-Style Poisoning (RSP), a paradigm that manipulates how agents process information rather than what they process. We introduce Generative Style Injection (GSI), an attack method that rewrites retrieved documents into pathological tones--specifically "analysis paralysis" or "cognitive haste"--without altering underlying facts or using explicit triggers. To quantify these shifts, we develop the Reasoning Style Vector (RSV), a metric tracking Verification depth, Self-confidence, and Attention focus. Experiments on HotpotQA and FEVER using ReAct, Reflection, and Tree of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science
