RAVEL: Rare Concept Generation and Editing via Graph-driven Relational Guidance
Kavana Venkatesh, Yusuf Dalva, Ismini Lourentzou, Pinar Yanardag

TL;DR
RAVEL is a training-free framework that enhances rare concept generation and editing in text-to-image diffusion models by integrating graph-based retrieval and self-correction, improving fidelity and nuance without visual exemplars.
Contribution
RAVEL introduces a novel, training-free, graph-guided retrieval-augmented generation method combined with a self-correction module, advancing rare concept depiction in diffusion models.
Findings
Outperforms state-of-the-art methods on new benchmarks
Improves attribute accuracy, coherence, and semantic fidelity
Compatible with multiple leading diffusion models
Abstract
Despite impressive visual fidelity, current text-to-image (T2I) diffusion models struggle to depict rare, complex, or culturally nuanced concepts due to training data limitations. We introduce RAVEL, a training-free framework that significantly improves rare concept generation, context-driven image editing, and self-correction by integrating graph-based retrieval-augmented generation (RAG) into diffusion pipelines. Unlike prior RAG and LLM-enhanced methods reliant on visual exemplars, static captions or pre-trained knowledge of models, RAVEL leverages structured knowledge graphs to retrieve compositional, symbolic, and relational context, enabling nuanced grounding even in the absence of visual priors. To further refine generation quality, we propose SRD, a novel self-correction module that iteratively updates prompts via multi-aspect alignment feedback, enhancing attribute accuracy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Image Retrieval and Classification Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Attention Is All You Need · Dense Connections · Byte Pair Encoding · Residual Connection · Multi-Head Attention · Weight Decay · WordPiece · Softmax
