Membership Inference Attacks on Vision-Language-Action Models
Yuefeng Peng, Mingzhe Li, Kejing Xia, Renhao Zhang, Amir Houmansadr

TL;DR
This paper systematically studies membership inference attacks on vision-language-action models, revealing significant privacy vulnerabilities especially in black-box scenarios, and highlights the need for dedicated defenses.
Contribution
First formalization and evaluation of MIAs on VLA models, demonstrating their high vulnerability and introducing attack methods under various access regimes.
Findings
Black-box attacks on VLA models are highly effective.
VLA models are more vulnerable due to their structured action outputs.
Membership inference poses a significant privacy risk for embodied AI systems.
Abstract
Membership inference attacks (MIAs) have been extensively studied in large language models (LLMs) and vision-language models (VLMs), yet their implications for vision-language-action (VLA) models remain largely unexplored. VLA models differ from standard LLMs and VLMs in several important ways: they are often fine-tuned for many epochs on relatively small embodied datasets, operate over constrained and structured action spaces, and expose action outputs that can be observed as executable behaviors and temporally correlated trajectories. These characteristics suggest a distinct and potentially more informative attack surface for membership inference. In this work, we present the first systematic study of MIAs against VLA systems. We formalize two membership inference settings for VLA models: sample-level inference over individual transition samples and trajectory-level inference over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
