VidSketch: Hand-drawn Sketch-Driven Video Generation with Diffusion Control
Lifan Jiang, Shuang Chen, Boxi Wu, Xiaotong Guan, Jiahui Zhang

TL;DR
VidSketch is a novel method that enables high-quality, controllable video animation generation from hand-drawn sketches and text prompts, addressing the limitations of static image-focused approaches.
Contribution
It introduces a Level-Based Sketch Control Strategy and a TempSpatial Attention mechanism to improve control and spatiotemporal consistency in video generation from sketches.
Findings
Achieves high-quality video animations from sketches and text prompts.
Enhances spatiotemporal coherence in generated videos.
Provides adjustable control for users with different drawing skills.
Abstract
With the advancement of generative artificial intelligence, previous studies have achieved the task of generating aesthetic images from hand-drawn sketches, fulfilling the public's needs for drawing. However, these methods are limited to static images and lack the ability to control video animation generation using hand-drawn sketches. To address this gap, we propose VidSketch, the first method capable of generating high-quality video animations directly from any number of hand-drawn sketches and simple text prompts, bridging the divide between ordinary users and professional artists. Specifically, our method introduces a Level-Based Sketch Control Strategy to automatically adjust the guidance strength of sketches during the generation process, accommodating users with varying drawing skills. Furthermore, a TempSpatial Attention mechanism is designed to enhance the spatiotemporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Virtual Reality Applications and Impacts · Augmented Reality Applications
MethodsSoftmax · Attention Is All You Need
