Persistent Story World Simulation with Continuous Character Customization
Jinlu Zhang, Qiyun Wang, Baoxiang Du, Jiayi Ji, Jing He, Rongsheng Zhang, Tangjie Lv, Xiaoshuai Sun, Rongrong Ji

TL;DR
EverTale is a novel story world simulation framework that enables continuous character customization and high-quality multi-character story visualization through integrated modules and quality control mechanisms.
Contribution
The paper introduces EverTale, a comprehensive story visualization system with a unified character integrator, a quality gate, and a region-focus sampling strategy, advancing multi-character storytelling.
Findings
Outperforms existing methods in story visualization quality
Ensures consistent character fidelity during story progression
Efficiently manages multi-character interactions and layout conflicts
Abstract
Story visualization has gained increasing attention in computer vision. However, current methods often fail to achieve a synergy between accurate character customization, semantic alignment, and continuous integration of new identities. To tackle this challenge, in this paper we present EverTale, a story world simulator for continuous story character customization. We first propose an All-in-One-World Character Integrator to achieve continuous character adaptation within unified LoRA module, eliminating the need for per-character optimization modules of previous methods. Then, we incorporate a Character Quality Gate via MLLM-as-Judge to ensure the fidelity of each character adaptation process through chain-of-thought reasoning, determining whether the model can proceed to the next character or require additional training on the current one. We also introduce a Character-Aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Human Motion and Animation · Generative Adversarial Networks and Image Synthesis
