StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
Jinlu Zhang, Jiji Tang, Rongsheng Zhang, Tangjie Lv, Xiaoshuai Sun

TL;DR
StoryWeaver introduces a knowledge graph-based approach for consistent and semantically rich story visualization, effectively maintaining character identity and improving multi-character generation in AI-generated stories.
Contribution
The paper presents a novel Character Graph and knowledge-enhanced spatial guidance to improve character consistency and semantic accuracy in story visualization.
Findings
Achieves +9.03% DINO-I and +13.44% CLIP-T improvements.
Effectively maintains character identity across scenarios.
Outperforms existing methods in visual story coherence.
Abstract
Story visualization has gained increasing attention in artificial intelligence. However, existing methods still struggle with maintaining a balance between character identity preservation and text-semantics alignment, largely due to a lack of detailed semantic modeling of the story scene. To tackle this challenge, we propose a novel knowledge graph, namely Character Graph (\textbf{CG}), which comprehensively represents various story-related knowledge, including the characters, the attributes related to characters, and the relationship between characters. We then introduce StoryWeaver, an image generator that achieve Customization via Character Graph (\textbf{C-CG}), capable of consistent story visualization with rich text semantics. To further improve the multi-character generation performance, we incorporate knowledge-enhanced spatial guidance (\textbf{KE-SG}) into StoryWeaver to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDigital Storytelling and Education · Natural Language Processing Techniques · Video Analysis and Summarization
MethodsSoftmax · Attention Is All You Need
