EmoFeedback$^2$: Reinforcement of Continuous Emotional Image Generation via LVLM-based Reward and Textual Feedback
Jingyang Jia, Kai Shu, Gang Yang, Long Xing, Xun Chen, Aiping Liu

TL;DR
EmoFeedback$^2$ introduces a reinforcement learning framework utilizing LVLM-based reward and textual feedback to improve continuous emotional image generation, achieving higher emotional fidelity and quality.
Contribution
The paper presents a novel LVLM-based reinforcement learning approach that incorporates emotion-aware rewards and iterative textual feedback for enhanced emotional image generation.
Findings
Outperforms existing methods in emotional fidelity
Generates high-quality images aligned with target emotions
Effectively maintains emotional continuity in images
Abstract
Continuous emotional image generation (C-EICG) is emerging rapidly due to its ability to produce images aligned with both user descriptions and continuous emotional values. However, existing approaches lack emotional feedback from generated images, limiting the control of emotional continuity. Additionally, their simple alignment between emotions and naively generated texts fails to adaptively adjust emotional prompts according to image content, leading to insufficient emotional fidelity. To address these concerns, we propose a novel generation-understanding-feedback reinforcement paradigm (EmoFeedback) for C-EICG, which exploits the reasoning capability of the fine-tuned large vision-language model (LVLM) to provide reward and textual feedback for generating high-quality images with continuous emotions. Specifically, we introduce an emotion-aware reward feedback strategy, where the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Aesthetic Perception and Analysis
