DreamRelation: Bridging Customization and Relation Generation
Qingyu Shi, Lu Qi, Jianzong Wu, Jinbin Bai, Jingbo Wang, Yunhai Tong,, Xiangtai Li

TL;DR
DreamRelation is a novel framework that enhances customized image generation by accurately modeling object identities and relationships, addressing pose adjustments and object confusion issues.
Contribution
It introduces a relation-aware generation framework with keypoint matching and local feature modules, improving relation accuracy and object distinction in generated images.
Findings
Outperforms existing models in relation accuracy and identity preservation
Effectively handles pose adjustments and object overlaps
Demonstrates superior results on new relation-specific benchmarks
Abstract
Customized image generation is essential for creating personalized content based on user prompts, allowing large-scale text-to-image diffusion models to more effectively meet individual needs. However, existing models often neglect the relationships between customized objects in generated images. In contrast, this work addresses this gap by focusing on relation-aware customized image generation, which seeks to preserve the identities from image prompts while maintaining the relationship specified in text prompts. Specifically, we introduce DreamRelation, a framework that disentangles identity and relation learning using a carefully curated dataset. Our training data consists of relation-specific images, independent object images containing identity information, and text prompts to guide relation generation. Then, we propose two key modules to tackle the two main challenges: generating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Human Motion and Animation · Multimodal Machine Learning Applications
MethodsDiffusion · Sparse Evolutionary Training
