EmoSpace: Fine-Grained Emotion Prototype Learning for Immersive Affective Content Generation

Bingyuan Wang; Xingbei Chen; Zongyang Qiu; Linping Yuan; and Zeyu Wang

arXiv:2602.11658·cs.CV·February 13, 2026

EmoSpace: Fine-Grained Emotion Prototype Learning for Immersive Affective Content Generation

Bingyuan Wang, Xingbei Chen, Zongyang Qiu, Linping Yuan, and Zeyu Wang

PDF

Open Access

TL;DR

EmoSpace introduces a novel framework for fine-grained emotion-aware content generation in VR, utilizing dynamic emotion prototypes and hierarchical representations to enable nuanced emotional control without explicit labels.

Contribution

The paper proposes EmoSpace, a new method that learns interpretable emotion prototypes through vision-language alignment for immersive content creation.

Findings

01

Outperforms existing methods in qualitative and quantitative evaluations.

02

Enables diverse applications like emotional image outpainting and VR panorama generation.

03

User study shows VR environments influence emotional perception more than desktop settings.

Abstract

Emotion is important for creating compelling virtual reality (VR) content. Although some generative methods have been applied to lower the barrier to creating emotionally rich content, they fail to capture the nuanced emotional semantics and the fine-grained control essential for immersive experiences. To address these limitations, we introduce EmoSpace, a novel framework for emotion-aware content generation that learns dynamic, interpretable emotion prototypes through vision-language alignment. We employ a hierarchical emotion representation with rich learnable prototypes that evolve during training, enabling fine-grained emotional control without requiring explicit emotion labels. We develop a controllable generation pipeline featuring multi-prototype guidance, temporal blending, and attention reweighting that supports diverse applications, including emotional image outpainting,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Aesthetic Perception and Analysis