TL;DR
KD-CVG introduces a knowledge-driven framework with a comprehensive knowledge base and modules to improve semantic alignment and motion realism in creative video generation for advertising.
Contribution
It presents a novel knowledge-driven approach with semantic-aware retrieval and multimodal knowledge reference modules to enhance creative video generation.
Findings
KD-CVG outperforms state-of-the-art methods in semantic alignment.
KD-CVG improves motion adaptability and realism in generated videos.
The approach demonstrates significant gains in experimental evaluations.
Abstract
Creative Generation (CG) leverages generative models to automatically produce advertising content that highlights product features, and it has been a significant focus of recent research. However, while CG has advanced considerably, most efforts have concentrated on generating advertising text and images, leaving Creative Video Generation (CVG) relatively underexplored. This gap is largely due to two major challenges faced by Text-to-Video (T2V) models: (a) \textbf{ambiguous semantic alignment}, where models struggle to accurately correlate product selling points with creative video content, and (b) \textbf{inadequate motion adaptability}, resulting in unrealistic movements and distortions. To address these challenges, we develop a comprehensive Advertising Creative Knowledge Base (ACKB) as a foundational resource and propose a knowledge-driven approach (KD-CVG) to overcome the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
