Generative Memorize-Then-Recall framework for low bit-rate Surveillance Video Compression
Yaojun Wu, Tianyu He, Zhibo Chen

TL;DR
This paper introduces a novel low bit-rate surveillance video compression method that leverages a memorization-recall framework using spatio-temporal features and skeleton cues, outperforming traditional standards like H.265.
Contribution
It proposes a new framework combining memory, skeleton cues, and generative adversarial networks for efficient surveillance video compression.
Findings
Achieves higher compression efficiency than H.265.
Generates realistic video reconstructions from minimal data.
Utilizes a novel attention mechanism for feature relation modeling.
Abstract
Applications of surveillance video have developed rapidly in recent years to protect public safety and daily life, which often detect and recognize objects in video sequences. Traditional coding frameworks remove temporal redundancy in surveillance video by block-wise motion compensation, lacking the extraction and utilization of inherent structure information. In this paper, we figure out this issue by disentangling surveillance video into the structure of a global spatio-temporal feature (memory) for Group of Picture (GoP) and skeleton for each frame (clue). The memory is obtained by sequentially feeding frame inside GoP into a recurrent neural network, describing appearance for objects that appeared inside GoP. While the skeleton is calculated by a pose estimator, it is regarded as a clue to recall memory. Furthermore, an attention mechanism is introduced to obtain the relation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Video Analysis and Summarization
