EmoKGEdit: Training-free Affective Injection via Visual Cue Transformation

Jing Zhang; Bingjie Fan

arXiv:2601.12326·cs.CV·January 21, 2026

EmoKGEdit: Training-free Affective Injection via Visual Cue Transformation

Jing Zhang, Bingjie Fan

PDF

Open Access

TL;DR

EmoKGEdit is a training-free framework that enables precise and structure-preserving image emotion editing by leveraging a knowledge graph to disentangle emotional cues from content.

Contribution

It introduces a novel knowledge graph and a disentangled editing module for emotion editing without training, improving fidelity and content preservation.

Findings

01

Outperforms state-of-the-art methods in emotion fidelity.

02

Maintains visual spatial coherence during editing.

03

Achieves high content preservation and emotional accuracy.

Abstract

Existing image emotion editing methods struggle to disentangle emotional cues from latent content representations, often yielding weak emotional expression and distorted visual structures. To bridge this gap, we propose EmoKGEdit, a novel training-free framework for precise and structure-preserving image emotion editing. Specifically, we construct a Multimodal Sentiment Association Knowledge Graph (MSA-KG) to disentangle the intricate relationships among objects, scenes, attributes, visual clues and emotion. MSA-KG explicitly encode the causal chain among object-attribute-emotion, and as external knowledge to support chain of thought reasoning, guiding the multimodal large model to infer plausible emotion-related visual cues and generate coherent instructions. In addition, based on MSA-KG, we design a disentangled structure-emotion editing module that explicitly separates emotional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Emotion and Mood Recognition · Sentiment Analysis and Opinion Mining