Point-wise mutual information-based video segmentation with high temporal consistency
Margret Keuper, Thomas Brox

TL;DR
This paper introduces a PMI-based method for video segmentation that ensures high temporal boundary consistency without relying on optical flow or learned motion models, outperforming existing methods.
Contribution
It presents a novel PMI-based approach for temporally consistent boundary detection in videos, independent of optical flow or motion learning.
Findings
Outperforms state-of-the-art in standard region metrics
Ensures high temporal boundary consistency
Does not depend on optical flow or learned motion models
Abstract
In this paper, we tackle the problem of temporally consistent boundary detection and hierarchical segmentation in videos. While finding the best high-level reasoning of region assignments in videos is the focus of much recent research, temporal consistency in boundary detection has so far only rarely been tackled. We argue that temporally consistent boundaries are a key component to temporally consistent region assignment. The proposed method is based on the point-wise mutual information (PMI) of spatio-temporal voxels. Temporal consistency is established by an evaluation of PMI-based point affinities in the spectral domain over space and time. Thus, the proposed method is independent of any optical flow computation or previously learned motion models. The proposed low-level video segmentation method outperforms the learning-based state of the art in terms of standard region metrics.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Human Pose and Action Recognition
