Correlation-Weighted Multi-Reward Optimization for Compositional Generation
Jungmyung Wi, Hyunsoo Kim, Donghyun Kim

TL;DR
This paper introduces Correlation-Weighted Multi-Reward Optimization (54444), a novel framework that adaptively weights concept rewards based on their correlation to improve compositional text-to-image generation.
Contribution
We propose a correlation-aware reward weighting method that balances competing concepts, enhancing the consistency and quality of multi-concept image generation.
Findings
Improved performance on multi-concept benchmarks
Consistent satisfaction of complex prompts
State-of-the-art results with diffusion models
Abstract
Text-to-image models produce images that align well with natural language prompts, but compositional generation has long been a central challenge. Models often struggle to satisfy multiple concepts within a single prompt, frequently omitting some concepts and resulting in partial success. Such failures highlight the difficulty of jointly optimizing multiple concepts during reward optimization, where competing concepts can interfere with one another. To address this limitation, we propose Correlation-Weighted Multi-Reward Optimization (\ours), a framework that leverages the correlation structure among concept rewards to adaptively weight each attribute concept in optimization. By accounting for interactions among concepts, \ours balances competing reward signals and emphasizes concepts that are partially satisfied yet inconsistently generated across samples, improving compositional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis
