TL;DR
FairyGen is an innovative system that automatically generates story-driven cartoon videos from a single child's drawing, combining character style preservation, cinematic shot design, and physically plausible motion reconstruction.
Contribution
It introduces a novel pipeline that disentangles character modeling from background style, incorporates cinematic shot design, and reconstructs 3D character motion for personalized story animation.
Findings
Produces stylistically faithful animations
Generates narratively coherent videos
Achieves natural motion in story scenes
Abstract
We propose FairyGen, an automatic system for generating story-driven cartoon videos from a single child's drawing, while faithfully preserving its unique artistic style. Unlike previous storytelling methods that primarily focus on character consistency and basic motion, FairyGen explicitly disentangles character modeling from stylized background generation and incorporates cinematic shot design to support expressive and coherent storytelling. Given a single character sketch, we first employ an MLLM to generate a structured storyboard with shot-level descriptions that specify environment settings, character actions, and camera perspectives. To ensure visual consistency, we introduce a style propagation adapter that captures the character's visual style and applies it to the background, faithfully retaining the character's full visual identity while synthesizing style-consistent scenes. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion · Adapter · Focus
