AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editing
Wenmin Huang, Weiqi Luo, Xiaochun Cao, and Jiwu Huang

TL;DR
AttDiff-GAN introduces a hybrid diffusion-GAN framework for facial attribute editing that improves accuracy and attribute preservation by combining feature-level adversarial learning with diffusion guidance.
Contribution
The paper proposes a novel hybrid framework that decouples attribute editing from image synthesis, integrating facial priors and semantic relationships for enhanced editing precision.
Findings
Achieves more accurate facial attribute editing on CelebA-HQ.
Better preservation of non-target attributes compared to state-of-the-art methods.
Demonstrates superior qualitative and quantitative results in experiments.
Abstract
Facial attribute editing aims to modify target attributes while preserving attribute-irrelevant content and overall image fidelity. Existing GAN-based methods provide favorable controllability, but often suffer from weak alignment between style codes and attribute semantics. Diffusion-based methods can synthesize highly realistic images; however, their editing precision is limited by the entanglement of semantic directions among different attributes. In this paper, we propose AttDiff-GAN, a hybrid framework that combines GAN-based attribute manipulation with diffusion-based image generation. A key challenge in such integration lies in the inconsistency between one-step adversarial learning and multi-step diffusion denoising, which makes effective optimization difficult. To address this issue, we decouple attribute editing from image synthesis by introducing a feature-level adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
