AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editing

Wenmin Huang; Weiqi Luo; Xiaochun Cao; and Jiwu Huang

arXiv:2604.21289·cs.CV·April 24, 2026

AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editing

Wenmin Huang, Weiqi Luo, Xiaochun Cao, and Jiwu Huang

PDF

TL;DR

AttDiff-GAN introduces a hybrid diffusion-GAN framework for facial attribute editing that improves accuracy and attribute preservation by combining feature-level adversarial learning with diffusion guidance.

Contribution

The paper proposes a novel hybrid framework that decouples attribute editing from image synthesis, integrating facial priors and semantic relationships for enhanced editing precision.

Findings

01

Achieves more accurate facial attribute editing on CelebA-HQ.

02

Better preservation of non-target attributes compared to state-of-the-art methods.

03

Demonstrates superior qualitative and quantitative results in experiments.

Abstract

Facial attribute editing aims to modify target attributes while preserving attribute-irrelevant content and overall image fidelity. Existing GAN-based methods provide favorable controllability, but often suffer from weak alignment between style codes and attribute semantics. Diffusion-based methods can synthesize highly realistic images; however, their editing precision is limited by the entanglement of semantic directions among different attributes. In this paper, we propose AttDiff-GAN, a hybrid framework that combines GAN-based attribute manipulation with diffusion-based image generation. A key challenge in such integration lies in the inconsistency between one-step adversarial learning and multi-step diffusion denoising, which makes effective optimization difficult. To address this issue, we decouple attribute editing from image synthesis by introducing a feature-level adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.