HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models

Zhiguang Lu; Qianqian Xu; Peisong Wen; Siran Dai; Qingming Huang

arXiv:2511.12547·cs.CV·December 2, 2025

HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models

Zhiguang Lu, Qianqian Xu, Peisong Wen, Siran Dai, Qingming Huang

PDF

Open Access

TL;DR

HiGFA introduces a hierarchical guidance method for diffusion models that improves fine-grained data augmentation by balancing global structure and detailed features, leading to more accurate synthetic images for fine-grained classification.

Contribution

It proposes a novel hierarchical guidance framework that dynamically modulates guidance signals during diffusion sampling for enhanced fine-grained image synthesis.

Findings

01

Improves synthetic image fidelity for fine-grained tasks

02

Enhances classifier performance with augmented data

03

Demonstrates effectiveness across multiple FGVC datasets

Abstract

Generative diffusion models show promise for data augmentation. However, applying them to fine-grained tasks presents a significant challenge: ensuring synthetic images accurately capture the subtle, category-defining features critical for high fidelity. Standard approaches, such as text-based Classifier-Free Guidance (CFG), often lack the required specificity, potentially generating misleading examples that degrade fine-grained classifier performance. To address this, we propose Hierarchically Guided Fine-grained Augmentation (HiGFA). HiGFA leverages the temporal dynamics of the diffusion sampling process. It employs strong text and transformed contour guidance with fixed strengths in the early-to-mid sampling stages to establish overall scene, style, and structure. In the final sampling stages, HiGFA activates a specialized fine-grained classifier guidance and dynamically modulates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · Domain Adaptation and Few-Shot Learning