Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation
Kwanyoung Lee, SeungJu Cha, Yebin Ahn, Hyunwoo Oh, Sungho Koh, Dong-Jin Kim

TL;DR
This paper introduces Adaptive Auxiliary Prompt Blending (AAPB), a training-free method that improves the fidelity and semantic accuracy of diffusion-based text-to-image models, especially for rare concepts and editing tasks.
Contribution
AAPB provides a novel, closed-form, adaptive prompt blending framework grounded in Tweedie's identity, enhancing target-faithful diffusion generation without additional training.
Findings
Improves semantic accuracy on RareBench dataset
Enhances structural fidelity on FlowEdit dataset
Outperforms prior training-free baselines
Abstract
Diffusion-based text-to-image (T2I) models have made remarkable progress in generating photorealistic and semantically rich images. However, when the target concepts lie in low-density regions of the training distribution, these models often produce semantically misaligned or structurally inconsistent results. This limitation arises from the long-tailed nature of text-image datasets, where rare concepts or editing instructions are underrepresented. To address this, we introduce Adaptive Auxiliary Prompt Blending (AAPB) - a unified framework that stabilizes the diffusion process in low-density regions. AAPB leverages auxiliary anchor prompts to provide semantic support in rare concept generation and structural support in image editing, ensuring faithful guidance toward the target prompt. Unlike prior heuristic prompt alternation methods, AAPB derives a closed-form adaptive coefficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
