Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space

Xingfu Zhou; Pengfei Wang

arXiv:2512.14448·cs.CR·December 17, 2025

Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space

Xingfu Zhou, Pengfei Wang

PDF

Open Access

TL;DR

This paper uncovers a novel attack on LLM agents that manipulates their reasoning style through stealthy document rewriting, significantly degrading performance and bypassing filters, and proposes a real-time monitoring defense mechanism.

Contribution

It introduces Reasoning-Style Poisoning (RSP), a new process-oriented attack on LLMs, and develops RSV metrics and a runtime monitor for detection and defense.

Findings

01

GSI significantly degrades LLM reasoning performance.

02

GSI can bypass content filters effectively.

03

RSV metrics enable real-time detection of reasoning style manipulation.

Abstract

Large Language Model (LLM) agents relying on external retrieval are increasingly deployed in high-stakes environments. While existing adversarial attacks primarily focus on content falsification or instruction injection, we identify a novel, process-oriented attack surface: the agent's reasoning style. We propose Reasoning-Style Poisoning (RSP), a paradigm that manipulates how agents process information rather than what they process. We introduce Generative Style Injection (GSI), an attack method that rewrites retrieved documents into pathological tones--specifically "analysis paralysis" or "cognitive haste"--without altering underlying facts or using explicit triggers. To quantify these shifts, we develop the Reasoning Style Vector (RSV), a metric tracking Verification depth, Self-confidence, and Attention focus. Experiments on HotpotQA and FEVER using ReAct, Reflection, and Tree of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science