EmoStory: Emotion-Aware Story Generation
Jingyuan Yang, Rucong Chen, Weibin Luo, Hui Huang

TL;DR
EmoStory introduces an emotion-aware story generation framework that creates coherent visual narratives with explicit emotional directions, enhancing engagement by grounding emotions in visual elements and maintaining subject consistency.
Contribution
This paper presents EmoStory, the first framework for emotion-aware story generation that integrates story planning and region-aware visual synthesis to incorporate explicit emotions.
Findings
Outperforms existing methods in emotion accuracy.
Achieves higher subject consistency in generated stories.
User studies favor EmoStory's emotional and narrative quality.
Abstract
Story generation aims to produce image sequences that depict coherent narratives while maintaining subject consistency across frames. Although existing methods have excelled in producing coherent and expressive stories, they remain largely emotion-neutral, focusing on what subject appears in a story while overlooking how emotions shape narrative interpretation and visual presentation. As stories are intended to engage audiences emotionally, we introduce emotion-aware story generation, a new task that aims to generate subject-consistent visual stories with explicit emotional directions. This task is challenging due to the abstract nature of emotions, which must be grounded in concrete visual elements and consistently expressed across a narrative through visual composition. To address these challenges, we propose EmoStory, a two-stage framework that integrates agent-based story planning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Artificial Intelligence in Games · Generative Adversarial Networks and Image Synthesis
