AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production
Jiuniu Wang, Zehua Du, Yuyuan Zhao, Bo Yuan, Kexiang Wang, Jian Liang,, Yaxi Zhao, Yihen Lu, Gengliang Li, Junlong Gao, Xin Tu, Zhenyu Guo

TL;DR
AesopAgent is an innovative agent-driven system that automates story-to-video production by integrating multimodal content generation, workflow optimization, and coherence mechanisms, achieving state-of-the-art results in visual storytelling.
Contribution
The paper introduces a novel RAG-based evolutionary framework for optimizing multimodal video generation workflows, enhancing coherence and content richness in story-to-video systems.
Findings
Achieves state-of-the-art performance in visual storytelling.
Effectively integrates multimodal content into coherent videos.
Optimizes workflows through iterative evolution and expert knowledge.
Abstract
The Agent and AIGC (Artificial Intelligence Generated Content) technologies have recently made significant progress. We propose AesopAgent, an Agent-driven Evolutionary System on Story-to-Video Production. AesopAgent is a practical application of agent technology for multimodal content generation. The system integrates multiple generative capabilities within a unified framework, so that individual users can leverage these modules easily. This innovative system would convert user story proposals into scripts, images, and audio, and then integrate these multimodal contents into videos. Additionally, the animating units (e.g., Gen-2 and Sora) could make the videos more infectious. The AesopAgent system could orchestrate task workflow for video generation, ensuring that the generated video is both rich in content and coherent. This system mainly contains two layers, i.e., the Horizontal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Digital Games and Media
Methodstravel james
