From Prompt to Physical Action: Structured Backdoor Attacks on LLM-Mediated Robotic Control Systems

Mingyang Xie; Jin Wei-Kocsis

arXiv:2604.03890·cs.RO·April 7, 2026

From Prompt to Physical Action: Structured Backdoor Attacks on LLM-Mediated Robotic Control Systems

Mingyang Xie, Jin Wei-Kocsis

PDF

TL;DR

This paper uncovers vulnerabilities in LLM-based robotic control systems, demonstrating how structured backdoors can reliably trigger physical actions with high success rates and proposing a defense mechanism using semantic verification.

Contribution

It introduces supply-chain backdoor strategies in LLM-mediated robotic control, revealing their effectiveness and exploring a semantic verification defense to mitigate attacks.

Findings

01

Backdoors at command format levels are more effective than natural-language reasoning backdoors.

02

Backdoored models achieve 83% attack success rate with over 93% accuracy and low latency.

03

Semantic verification reduces attack success to 20% but increases response latency.

Abstract

The integration of large language models (LLMs) into robotic control pipelines enables natural language interfaces that translate user prompts into executable commands. However, this digital-to-physical interface introduces a critical and underexplored vulnerability: structured backdoor attacks embedded during fine-tuning. In this work, we experimentally investigate LoRA-based supply-chain backdoors in LLM-mediated ROS2 robotic control systems and evaluate their impact on physical robot execution. We construct two poisoned fine-tuning strategies targeting different stages of the command generation pipeline and reveal a key systems-level insight: back-doors embedded at the natural-language reasoning stage do not reliably propagate to executable control outputs, whereas backdoors aligned directly with structured JSON command formats successfully survive translation and trigger physical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.