Towards LLM-centric Affective Visual Customization via Efficient and Precise Emotion Manipulating

Jiamin Luo; Xuqian Gu; Jingjing Wang; Jiahong Lu

arXiv:2602.18016·cs.CV·February 23, 2026

Towards LLM-centric Affective Visual Customization via Efficient and Precise Emotion Manipulating

Jiamin Luo, Xuqian Gu, Jingjing Wang, Jiahong Lu

PDF

Open Access

TL;DR

This paper introduces an LLM-based affective visual customization framework that efficiently and accurately modifies images' subjective emotions while preserving content, addressing limitations of prior methods that ignored emotional content.

Contribution

It proposes a novel LLM-centric affective visual customization task and introduces the EPEM approach with modules for efficient emotion conversion and precise content retention.

Findings

01

EPEM outperforms state-of-the-art baselines in experiments

02

The approach effectively manipulates subjective emotions in images

03

The method preserves emotion-agnostic content accurately

Abstract

Previous studies on visual customization primarily rely on the objective alignment between various control signals (e.g., language, layout and canny) and the edited images, which largely ignore the subjective emotional contents, and more importantly lack general-purpose foundation models for affective visual customization. With this in mind, this paper proposes an LLM-centric Affective Visual Customization (L-AVC) task, which focuses on generating images within modifying their subjective emotions via Multimodal LLM. Further, this paper contends that how to make the model efficiently align emotion conversion in semantics (named inter-emotion semantic conversion) and how to precisely retain emotion-agnostic contents (named exter-emotion semantic retaining) are rather important and challenging in this L-AVC task. To this end, this paper proposes an Efficient and Precise Emotion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Visual Attention and Saliency Detection