EmoKGEdit: Training-free Affective Injection via Visual Cue Transformation
Jing Zhang, Bingjie Fan

TL;DR
EmoKGEdit is a training-free framework that enables precise and structure-preserving image emotion editing by leveraging a knowledge graph to disentangle emotional cues from content.
Contribution
It introduces a novel knowledge graph and a disentangled editing module for emotion editing without training, improving fidelity and content preservation.
Findings
Outperforms state-of-the-art methods in emotion fidelity.
Maintains visual spatial coherence during editing.
Achieves high content preservation and emotional accuracy.
Abstract
Existing image emotion editing methods struggle to disentangle emotional cues from latent content representations, often yielding weak emotional expression and distorted visual structures. To bridge this gap, we propose EmoKGEdit, a novel training-free framework for precise and structure-preserving image emotion editing. Specifically, we construct a Multimodal Sentiment Association Knowledge Graph (MSA-KG) to disentangle the intricate relationships among objects, scenes, attributes, visual clues and emotion. MSA-KG explicitly encode the causal chain among object-attribute-emotion, and as external knowledge to support chain of thought reasoning, guiding the multimodal large model to infer plausible emotion-related visual cues and generate coherent instructions. In addition, based on MSA-KG, we design a disentangled structure-emotion editing module that explicitly separates emotional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Emotion and Mood Recognition · Sentiment Analysis and Opinion Mining
