Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation

Kwanyoung Lee; SeungJu Cha; Yebin Ahn; Hyunwoo Oh; Sungho Koh; Dong-Jin Kim

arXiv:2603.19158·cs.CV·March 20, 2026

Adaptive Auxiliary Prompt Blending for Target-Faithful Diffusion Generation

Kwanyoung Lee, SeungJu Cha, Yebin Ahn, Hyunwoo Oh, Sungho Koh, Dong-Jin Kim

PDF

Open Access

TL;DR

This paper introduces Adaptive Auxiliary Prompt Blending (AAPB), a training-free method that improves the fidelity and semantic accuracy of diffusion-based text-to-image models, especially for rare concepts and editing tasks.

Contribution

AAPB provides a novel, closed-form, adaptive prompt blending framework grounded in Tweedie's identity, enhancing target-faithful diffusion generation without additional training.

Findings

01

Improves semantic accuracy on RareBench dataset

02

Enhances structural fidelity on FlowEdit dataset

03

Outperforms prior training-free baselines

Abstract

Diffusion-based text-to-image (T2I) models have made remarkable progress in generating photorealistic and semantically rich images. However, when the target concepts lie in low-density regions of the training distribution, these models often produce semantically misaligned or structurally inconsistent results. This limitation arises from the long-tailed nature of text-image datasets, where rare concepts or editing instructions are underrepresented. To address this, we introduce Adaptive Auxiliary Prompt Blending (AAPB) - a unified framework that stabilizes the diffusion process in low-density regions. AAPB leverages auxiliary anchor prompts to provide semantic support in rare concept generation and structural support in image editing, ensuring faithful guidance toward the target prompt. Unlike prior heuristic prompt alternation methods, AAPB derives a closed-form adaptive coefficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning