Video Primal Sketch: A Unified Middle-Level Representation for Video
Zhi Han, Zongben Xu, Song-Chun Zhu

TL;DR
This paper introduces Video Primal Sketch (VPS), a unified middle-level video representation combining explicit primitives and implicit textured motion models, enabling effective synthesis, reconstruction, and analysis of complex video content.
Contribution
It proposes a hybrid model integrating sparse coding of primitives with feature-statistics-based textured motion models, advancing unified video representation techniques.
Findings
Successfully synthesizes textured motion videos.
Reconstructs real videos with high perceptual quality.
Demonstrates VPS's applicability across scales and action recognition.
Abstract
This paper presents a middle-level video representation named Video Primal Sketch (VPS), which integrates two regimes of models: i) sparse coding model using static or moving primitives to explicitly represent moving corners, lines, feature points, etc., ii) FRAME /MRF model reproducing feature statistics extracted from input video to implicitly represent textured motion, such as water and fire. The feature statistics include histograms of spatio-temporal filters and velocity distributions. This paper makes three contributions to the literature: i) Learning a dictionary of video primitives using parametric generative models; ii) Proposing the Spatio-Temporal FRAME (ST-FRAME) and Motion-Appearance FRAME (MA-FRAME) models for modeling and synthesizing textured motion; and iii) Developing a parsimonious hybrid model for generic video representation. Given an input video, VPS selects the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
