Towards LLM-centric Affective Visual Customization via Efficient and Precise Emotion Manipulating
Jiamin Luo, Xuqian Gu, Jingjing Wang, Jiahong Lu

TL;DR
This paper introduces an LLM-based affective visual customization framework that efficiently and accurately modifies images' subjective emotions while preserving content, addressing limitations of prior methods that ignored emotional content.
Contribution
It proposes a novel LLM-centric affective visual customization task and introduces the EPEM approach with modules for efficient emotion conversion and precise content retention.
Findings
EPEM outperforms state-of-the-art baselines in experiments
The approach effectively manipulates subjective emotions in images
The method preserves emotion-agnostic content accurately
Abstract
Previous studies on visual customization primarily rely on the objective alignment between various control signals (e.g., language, layout and canny) and the edited images, which largely ignore the subjective emotional contents, and more importantly lack general-purpose foundation models for affective visual customization. With this in mind, this paper proposes an LLM-centric Affective Visual Customization (L-AVC) task, which focuses on generating images within modifying their subjective emotions via Multimodal LLM. Further, this paper contends that how to make the model efficiently align emotion conversion in semantics (named inter-emotion semantic conversion) and how to precisely retain emotion-agnostic contents (named exter-emotion semantic retaining) are rather important and challenging in this L-AVC task. To this end, this paper proposes an Efficient and Precise Emotion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Visual Attention and Saliency Detection
