Coherent Loss: A Generic Framework for Stable Video Segmentation
Mingyang Qian, Yi Fu, Xiao Tan, Yingying Li, Jinqing Qi, Huchuan Lu,, Shilei Wen, Errui Ding

TL;DR
This paper introduces Coherent Loss, a generic framework that improves the temporal stability of video segmentation, reducing jittering artifacts and enhancing visual quality without sacrificing accuracy.
Contribution
The paper proposes a novel Coherent Loss framework that enhances temporal stability in video segmentation models, addressing jittering artifacts in practical applications.
Findings
Significant reduction in jittering artifacts in video segmentation results.
Improved visual quality on multiple datasets including DAVIS and Cityscape.
Enhanced model performance with high accuracy and consistency.
Abstract
Video segmentation approaches are of great importance for numerous vision tasks especially in video manipulation for entertainment. Due to the challenges associated with acquiring high-quality per-frame segmentation annotations and large video datasets with different environments at scale, learning approaches shows overall higher accuracy on test dataset but lack strict temporal constraints to self-correct jittering artifacts in most practical applications. We investigate how this jittering artifact degrades the visual quality of video segmentation results and proposed a metric of temporal stability to numerically evaluate it. In particular, we propose a Coherent Loss with a generic framework to enhance the performance of a neural network against jittering artifacts, which combines with high accuracy and high consistency. Equipped with our method, existing video object/semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Image and Video Quality Assessment
