Unsupervised Video Segmentation via Spatio-Temporally Nonlocal Appearance Learning
Kaihua Zhang, Xuejun Li, Qingshan Liu

TL;DR
This paper introduces a novel unsupervised video segmentation method that leverages long-term spatio-temporal nonlocal appearance information, improving segmentation stability amidst rapid motion and shape changes.
Contribution
It proposes a simple approach to mine long-term nonlocal appearance features using superpixels and KD-tree search, enhancing segmentation accuracy over existing methods.
Findings
Outperforms several state-of-the-art methods on SegTrack and Youtube-Objects datasets.
Effectively handles large viewpoint changes and non-rigid deformations.
Demonstrates the benefit of long-term nonlocal appearance information in video segmentation.
Abstract
Video object segmentation is challenging due to the factors like rapidly fast motion, cluttered backgrounds, arbitrary object appearance variation and shape deformation. Most existing methods only explore appearance information between two consecutive frames, which do not make full use of the usefully long-term nonlocal information that is helpful to make the learned appearance stable, and hence they tend to fail when the targets suffer from large viewpoint changes and significant non-rigid deformations. In this paper, we propose a simple yet effective approach to mine the long-term sptatio-temporally nonlocal appearance information for unsupervised video segmentation. The motivation of our algorithm comes from the spatio-temporal nonlocality of the region appearance reoccurrence in a video. Specifically, we first generate a set of superpixels to represent the foreground and background,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
