TL;DR
This paper introduces a novel security attack on multi-robot systems controlled by LLMs, where compromising a single robot can propagate malicious commands, leading to unsafe coordinated actions, highlighting significant safety risks.
Contribution
The study presents a new attack paradigm demonstrating how malicious control of one robot can spread through communication, exposing vulnerabilities in multi-robot LLM-based coordination.
Findings
Obedience reaches 1.00 in strong attack cases
Infectiousness rises to 0.90, indicating rapid propagation
Compromise of all robots in as few as 3 rounds
Abstract
Large language models (LLMs) are increasingly used as general planners in embodied intelligence, enabling high level coordination and low level task planning for both single robot and multi-robot collaboration. This increasing reliance on embodied LLM planners also raises critical security concerns, since misaligned or manipulated instructions can be translated into physical actions. Prior work has studied such threats in single robot settings, while security risks in LLM controlled multi-robot collaboration, especially those propagated through inter robot communication, remain largely unexplored. To bridge this gap, we propose a novel attack paradigm for multi-robot system in which the adversary interacts with only a single entry robot. The compromised robot then propagates malicious intent through peer communication, leading to coordinated unsafe actions across the system. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
