Interpreting and Enhancing Emotional Circuits in Large Vision-Language Models via Cross-Modal Information Flow
Chengsheng Zhang, Chenghao Sun, Zhining Xie, Xinmei Tian

TL;DR
This paper investigates how large vision-language models process emotional information, revealing internal mechanisms and proposing interventions to improve emotional reasoning and reduce hallucinations.
Contribution
It introduces a causal attribution framework and a specialized dataset to analyze and enhance emotional circuits in LVLMs, improving their emotional reasoning capabilities.
Findings
Visual emotional cues are aggregated in middle layers via sentiment-specific attention heads.
Deep layers translate emotional cues into narratives through emotion-general pathways.
Inference-time interventions improve model performance and reduce emotional hallucinations.
Abstract
Large Vision-Language Models (LVLMs) represent a significant leap towards empathetic agents, demonstrating remarkable capabilities in emotion understanding. However, the internal mechanisms governing how LVLMs translate abstract visual stimuli into coherent emotional narratives remain largely unexplored, primarily due to the scarcity of visual counterfactuals and the diffuse nature of emotional expression. In this paper, we bridge this gap by introducing a steering-vector-based causal attribution framework tailored for descriptive emotional reasoning. To this end, we construct a specialized dataset to demystify the emotional circuits underlying the three-stage ``Adapt-Aggregate-Execute'' mechanism. Crucially, we discover a functional decoupling: visual emotional cues are aggregated in middle layers via sentiment-specific attention heads, but are subsequently translated into narrative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
