TCOVIS: Temporally Consistent Online Video Instance Segmentation
Junlong Li, Bingyao Yu, Yongming Rao, Jie Zhou, Jiwen Lu

TL;DR
TCOVIS introduces a novel online video instance segmentation method that leverages global temporal information and spatio-temporal enhancement to achieve state-of-the-art results in real-time scenarios.
Contribution
The paper proposes TCOVIS, an online VIS method with a global instance assignment strategy and spatio-temporal enhancement, improving temporal consistency without offline processing.
Findings
Achieves state-of-the-art performance on multiple VIS benchmarks.
Outperforms previous online methods in temporal consistency.
Demonstrates effectiveness with different backbone architectures.
Abstract
In recent years, significant progress has been made in video instance segmentation (VIS), with many offline and online methods achieving state-of-the-art performance. While offline methods have the advantage of producing temporally consistent predictions, they are not suitable for real-time scenarios. Conversely, online methods are more practical, but maintaining temporal consistency remains a challenging task. In this paper, we propose a novel online method for video instance segmentation, called TCOVIS, which fully exploits the temporal information in a video clip. The core of our method consists of a global instance assignment strategy and a spatio-temporal enhancement module, which improve the temporal consistency of the features from two aspects. Specifically, we perform global optimal matching between the predictions and ground truth across the whole video clip, and supervise the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
