DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross-Domain

Jun Liu; Jiantao Zhou; Jiandian Zeng; Jinyu Tian; Isao Echizen

arXiv:2406.03017·cs.CV·March 25, 2026

DifAttack++: Query-Efficient Black-Box Adversarial Attack via Hierarchical Disentangled Feature Space in Cross-Domain

Jun Liu, Jiantao Zhou, Jiandian Zeng, Jinyu Tian, Isao Echizen

PDF

Open Access 1 Repo

TL;DR

DifAttack++ introduces a hierarchical disentangled feature space approach for query-efficient black-box adversarial attacks, significantly improving success rate and efficiency while maintaining visual quality by disentangling adversarial and visual features.

Contribution

The paper proposes a novel hierarchical disentangled feature space framework for black-box attacks, enabling targeted optimization of adversarial features with improved efficiency and success rate.

Findings

01

Achieves higher attack success rate than state-of-the-art methods.

02

Reduces query complexity for generating adversarial examples.

03

Maintains high visual quality of adversarial examples.

Abstract

This work investigates efficient score-based black-box adversarial attacks that achieve a high Attack Success Rate (ASR) and good generalization ability. We propose a novel attack framework, termed DifAttack++, which operates in a hierarchical disentangled feature space and significantly differs from existing methods that manipulate the entire feature space. Specifically, DifAttack++ firstly disentangles an image's latent representation into an Adversarial Feature (AF) and a Visual Feature (VF) using an autoencoder equipped with a carefully designed Hierarchical Decouple-Fusion (HDF) module. In this formulation, the AF primarily governs the adversarial capability of an image, while the VF largely preserves its visual appearance. To enable the feature disentanglement and image reconstruction, we jointly train two autoencoders for the clean and adversarial image domains, i.e.,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

csjunjun/difattack
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection

MethodsAutoencoders