Exploring the Robustness of Decision-Level Through Adversarial Attacks   on LLM-Based Embodied Models

Shuyuan Liu; Jiawei Chen; Shouwei Ruan; Hang Su; Zhaoxia Yin

arXiv:2405.19802·cs.MM·July 17, 2024

Exploring the Robustness of Decision-Level Through Adversarial Attacks on LLM-Based Embodied Models

Shuyuan Liu, Jiawei Chen, Shouwei Ruan, Hang Su, Zhaoxia Yin

PDF

Open Access

TL;DR

This paper introduces a new dataset and attack strategies to evaluate and demonstrate the vulnerability of LLM-based embodied models to adversarial prompt manipulations, revealing their limited robustness.

Contribution

The paper constructs the Embodied Intelligent Robot Attack Dataset (EIRAD) and develops novel attack methods, including evaluation techniques, to assess the robustness of LLM-based embodied models.

Findings

01

Higher attack success rates on LLM-based embodied models.

02

Lower decision-level robustness observed in these models.

03

Efficient prompt suffix initialization improves attack convergence.

Abstract

Embodied intelligence empowers agents with a profound sense of perception, enabling them to respond in a manner closely aligned with real-world situations. Large Language Models (LLMs) delve into language instructions with depth, serving a crucial role in generating plans for intricate tasks. Thus, LLM-based embodied models further enhance the agent's capacity to comprehend and process information. However, this amalgamation also ushers in new challenges in the pursuit of heightened intelligence. Specifically, attackers can manipulate LLMs to produce irrelevant or even malicious outputs by altering their prompts. Confronted with this challenge, we observe a notable absence of multi-modal datasets essential for comprehensively evaluating the robustness of LLM-based embodied models. Consequently, we construct the Embodied Intelligent Robot Attack Dataset (EIRAD), tailored specifically for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques