ATAAT: Adaptive Threat-Aware Adversarial Tuning Framework against Backdoor Attacks on Vision-Language-Action Models

Kewei Chen; Yayu Long; Shuai Li; Mingsheng Shang

arXiv:2605.08612·cs.RO·May 12, 2026

ATAAT: Adaptive Threat-Aware Adversarial Tuning Framework against Backdoor Attacks on Vision-Language-Action Models

Kewei Chen, Yayu Long, Shuai Li, Mingsheng Shang

PDF

TL;DR

This paper introduces ATAAT, a novel framework that enhances the robustness of vision-language-action models against backdoor attacks by intelligently adapting attack strategies, demonstrating high success rates and stealthiness.

Contribution

The paper proposes ATAAT with a Threat-Method Adaptive Mapping mechanism, addressing gradient interference and enabling effective, stealthy backdoor attacks on VLAs.

Findings

01

ATAAT achieves over 80% Targeted Attack Success Rate.

02

It maintains high stealthiness with only 5% poisoning.

03

First to demonstrate implicit decoupled attacks in data poisoning.

Abstract

Addressing the escalating security vulnerabilities in Vision-Language-Action (VLA) models, this study investigates backdoor attacks targeting the visual pathway. We identify a core obstacle causing the failure of traditional attack paradigms: "Gradient Interference." This phenomenon represents an optimization failure triggered by conflicting strategies during end-to-end training. To resolve this, we propose an Adaptive Threat-Aware Adversarial Tuning (ATAAT) framework. Through its core "Threat-Method Adaptive Mapping" mechanism, ATAAT intelligently selects the optimal gradient decoupling strategy based on the adversary's capabilities. Extensive experiments demonstrate that ATAAT exhibits significant advantages, achieving a highly robust Targeted Attack Success Rate (TASR > 80%) while maintaining extreme stealthiness with merely a 5% poisoning rate. It efficiently handles complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.