TRAIL: Transferable Robust Adversarial Images via Latent diffusion

Yuhao Xue; Zhifei Zhang; Xinyang Jiang; Yifei Shen; Junyao Gao; Wentao Gu; Jiale Zhao; Miaojing Shi; Cairong Zhao

arXiv:2505.16166·cs.CV·May 23, 2025

TRAIL: Transferable Robust Adversarial Images via Latent diffusion

Yuhao Xue, Zhifei Zhang, Xinyang Jiang, Yifei Shen, Junyao Gao, Wentao Gu, Jiale Zhao, Miaojing Shi, Cairong Zhao

PDF

Open Access

TL;DR

TRAIL introduces a test-time adaptation framework using latent diffusion models to generate more transferable adversarial images, significantly improving black-box attack success rates by aligning adversarial features with natural image distributions.

Contribution

The paper proposes TRAIL, a novel method that adaptively updates diffusion models during attacks to produce more realistic and transferable adversarial images, addressing distribution mismatch issues.

Findings

01

TRAIL outperforms existing methods in cross-model transferability.

02

Distribution-aligned adversarial feature synthesis enhances black-box attack effectiveness.

03

Adaptive diffusion model updates improve the realism and transferability of adversarial samples.

Abstract

Adversarial attacks exploiting unrestricted natural perturbations present severe security risks to deep learning systems, yet their transferability across models remains limited due to distribution mismatches between generated adversarial features and real-world data. While recent works utilize pre-trained diffusion models as adversarial priors, they still encounter challenges due to the distribution shift between the distribution of ideal adversarial samples and the natural image distribution learned by the diffusion model. To address the challenge, we propose Transferable Robust Adversarial Images via Latent Diffusion (TRAIL), a test-time adaptation framework that enables the model to generate images from a distribution of images with adversarial features and closely resembles the target images. To mitigate the distribution shift, during attacks, TRAIL updates the diffusion U-Net's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques

MethodsDiffusion