TRAIL: Transferable Robust Adversarial Images via Latent diffusion
Yuhao Xue, Zhifei Zhang, Xinyang Jiang, Yifei Shen, Junyao Gao, Wentao Gu, Jiale Zhao, Miaojing Shi, Cairong Zhao

TL;DR
TRAIL introduces a test-time adaptation framework using latent diffusion models to generate more transferable adversarial images, significantly improving black-box attack success rates by aligning adversarial features with natural image distributions.
Contribution
The paper proposes TRAIL, a novel method that adaptively updates diffusion models during attacks to produce more realistic and transferable adversarial images, addressing distribution mismatch issues.
Findings
TRAIL outperforms existing methods in cross-model transferability.
Distribution-aligned adversarial feature synthesis enhances black-box attack effectiveness.
Adaptive diffusion model updates improve the realism and transferability of adversarial samples.
Abstract
Adversarial attacks exploiting unrestricted natural perturbations present severe security risks to deep learning systems, yet their transferability across models remains limited due to distribution mismatches between generated adversarial features and real-world data. While recent works utilize pre-trained diffusion models as adversarial priors, they still encounter challenges due to the distribution shift between the distribution of ideal adversarial samples and the natural image distribution learned by the diffusion model. To address the challenge, we propose Transferable Robust Adversarial Images via Latent Diffusion (TRAIL), a test-time adaptation framework that enables the model to generate images from a distribution of images with adversarial features and closely resembles the target images. To mitigate the distribution shift, during attacks, TRAIL updates the diffusion U-Net's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
MethodsDiffusion
