Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion

Chunlong Xie; Jialing He; Shangwei Guo; Jiacheng Wang; Shudong Zhang; Tianwei Zhang; Tao Xiang

arXiv:2505.23266·cs.CR·May 30, 2025

Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion

Chunlong Xie, Jialing He, Shangwei Guo, Jiacheng Wang, Shudong Zhang, Tianwei Zhang, Tao Xiang

PDF

Open Access

TL;DR

This paper introduces AdvOF, an adversarial attack framework that manipulates 3D objects to disrupt vision-language navigation agents, revealing vulnerabilities in service-oriented perception systems and emphasizing the need for robustness.

Contribution

The paper presents a novel adversarial object fusion method targeting VLM-based navigation, with a detailed optimization process and extensive evaluation demonstrating its effectiveness.

Findings

01

AdvOF significantly degrades agent navigation performance.

02

The attack maintains minimal interference with normal tasks.

03

The work highlights security vulnerabilities in VLM-powered navigation systems.

Abstract

We present Adversarial Object Fusion (AdvOF), a novel attack framework targeting vision-and-language navigation (VLN) agents in service-oriented environments by generating adversarial 3D objects. While foundational models like Large Language Models (LLMs) and Vision Language Models (VLMs) have enhanced service-oriented navigation systems through improved perception and decision-making, their integration introduces vulnerabilities in mission-critical service workflows. Existing adversarial attacks fail to address service computing contexts, where reliability and quality-of-service (QoS) are paramount. We utilize AdvOF to investigate and explore the impact of adversarial environments on the VLM-based perception module of VLN agents. In particular, AdvOF first precisely aggregates and aligns the victim object positions in both 2D and 3D space, defining and rendering adversarial objects.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

Methodstravel james