TL;DR
This paper introduces SynoClip, a standard dataset for video synopsis, and proposes FGS, a low-computational model that improves efficiency in condensing surveillance videos through novel algorithms.
Contribution
The work provides the first standard dataset for video synopsis and develops a low-cost, effective model with new algorithms for object detection and tube rearrangement.
Findings
SynoClip enables consistent evaluation of video synopsis models.
FGS achieves competitive performance with reduced computational cost.
The proposed algorithms improve the quality and efficiency of video condensation.
Abstract
Video synopsis is an efficient method for condensing surveillance videos. This technique begins with the detection and tracking of objects, followed by the creation of object tubes. These tubes consist of sequences, each containing chronologically ordered bounding boxes of a unique object. To generate a condensed video, the first step involves rearranging the object tubes to maximize the number of non-overlapping objects in each frame. Then, these tubes are stitched to a background image extracted from the source video. The lack of a standard dataset for the video synopsis task hinders the comparison of different video synopsis models. This paper addresses this issue by introducing a standard dataset, called SynoClip, designed specifically for the video synopsis task. SynoClip includes all the necessary features needed to evaluate various models directly and effectively. Additionally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
