PROTEA: Securing Robot Task Planning and Execution

Zainab Altaweel; Mohaiminul Al Nahian; Jake Juettner; Adnan Siraj Rakin; Shiqi Zhang

arXiv:2601.07186·cs.RO·January 13, 2026

PROTEA: Securing Robot Task Planning and Execution

Zainab Altaweel, Mohaiminul Al Nahian, Jake Juettner, Adnan Siraj Rakin, Shiqi Zhang

PDF

Open Access

TL;DR

PROTEA introduces an LLM-based defense mechanism to evaluate and enhance the security of robot task planning against adversarial attacks, addressing safety assessment challenges in complex robotic systems.

Contribution

The paper presents PROTEA, a novel LLM-as-a-Judge framework for assessing and improving the security of robot task plans against malicious manipulations.

Findings

01

PROTEA effectively detects malicious task plans with high accuracy.

02

The dataset includes diverse benign and malicious plans with varying stealthiness.

03

Systematic evaluation shows PROTEA enhances robustness of robotic task planning.

Abstract

Robots need task planning methods to generate action sequences for complex tasks. Recent work on adversarial attacks has revealed significant vulnerabilities in existing robot task planners, especially those built on foundation models. In this paper, we aim to address these security challenges by introducing PROTEA, an LLM-as-a-Judge defense mechanism, to evaluate the security of task plans. PROTEA is developed to address the dimensionality and history challenges in plan safety assessment. We used different LLMs to implement multiple versions of PROTEA for comparison purposes. For systemic evaluations, we created a dataset containing both benign and malicious task plans, where the harmful behaviors were injected at varying levels of stealthiness. Our results provide actionable insights for robotic system practitioners seeking to enhance robustness and security of their task planning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Safety Systems Engineering in Autonomy · Security and Verification in Computing