Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models
Yichen Sun, Zhixuan Chu, Zhan Qin, Kui Ren

TL;DR
This paper proposes a unified diffusion-based framework that enhances text-to-image generation by improving alignment with input descriptions through knowledge graphs and controllable models, significantly reducing inconsistencies.
Contribution
It introduces a novel framework combining LLMs, knowledge graphs, and controllable diffusion models to address image-text inconsistency in T2I generation.
Findings
Improved image-text alignment demonstrated on a multimodal hallucination benchmark.
Effective object location prediction enhances image generation accuracy.
Framework reduces content contradictions in generated images.
Abstract
The rapid advancement of Text-to-Image(T2I) generative models has enabled the synthesis of high-quality images guided by textual descriptions. Despite this significant progress, these models are often susceptible in generating contents that contradict the input text, which poses a challenge to their reliability and practical deployment. To address this problem, we introduce a novel diffusion-based framework to significantly enhance the alignment of generated images with their corresponding descriptions, addressing the inconsistency between visual output and textual input. Our framework is built upon a comprehensive analysis of inconsistency phenomena, categorizing them based on their manifestation in the image. Leveraging a state-of-the-art large language module, we first extract objects and construct a knowledge graph to predict the locations of these objects in potentially generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Reservoir Engineering and Simulation Methods · AI in cancer detection
