LIBSVX: A Supervoxel Library and Benchmark for Early Video Processing
Chenliang Xu, Jason J. Corso

TL;DR
This paper evaluates seven supervoxel algorithms across multiple quality metrics and datasets, providing insights into their strengths and suitability for different video analysis tasks.
Contribution
It introduces a comprehensive benchmark and evaluation framework for supervoxel segmentation, comparing multiple algorithms on diverse criteria and datasets.
Findings
GBH captures object boundaries best
SWA has the best potential for region compression
TSP achieves the best undersegmentation error
Abstract
Supervoxel segmentation has strong potential to be incorporated into early video analysis as superpixel segmentation has in image analysis. However, there are many plausible supervoxel methods and little understanding as to when and where each is most appropriate. Indeed, we are not aware of a single comparative study on supervoxel segmentation. To that end, we study seven supervoxel algorithms, including both off-line and streaming methods, in the context of what we consider to be a good supervoxel: namely, spatiotemporal uniformity, object/region boundary detection, region compression and parsimony. For the evaluation we propose a comprehensive suite of seven quality metrics to measure these desirable supervoxel characteristics. In addition, we evaluate the methods in a supervoxel classification task as a proxy for subsequent high-level uses of the supervoxels in video analysis. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
