CtrlAttack: A Unified Attack on World-Model Control in Diffusion Models
Shuhan Xu, Siyuan Liang, Hongling Zheng, Yong Luo, Han Hu, Lefei Zhang, Dacheng Tao

TL;DR
This paper introduces CtrlAttack, a novel method to disrupt the temporal dynamics of diffusion-based image-to-video models, exposing their vulnerability and highlighting security concerns in their state transition mechanisms.
Contribution
We propose a trajectory-control attack that interferes with state evolution in I2V models, revealing their susceptibility to low-dimensional perturbations across attack settings.
Findings
Achieves over 90% attack success rate in white-box settings.
Over 80% success rate in black-box settings.
Maintains low impact on visual quality metrics.
Abstract
Diffusion-based image-to-video (I2V) models increasingly exhibit world-model-like properties by implicitly capturing temporal dynamics. However, existing studies have mainly focused on visual quality and controllability, and the robustness of the state transition learned by the model remains understudied. To fill this gap, we are the first to analyze the vulnerability of I2V models, find that temporal control mechanisms constitute a new attack surface, and reveal the challenge of modeling them uniformly under different attack settings. Based on this, we propose a trajectory-control attack, called CtrlAttack, to interfere with state evolution during the generation process. Specifically, we represent the perturbation as a low-dimensional velocity field and construct a continuous displacement field via temporal integration, thereby affecting the model's state transitions while maintaining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Smart Grid Security and Resilience
