TIVE: A Toolbox for Identifying Video Instance Segmentation Errors
Wenhe Jia, Lu Yang, Zilong Jia, Wenyi Zhao, Yilin Zhou, Qing Song

TL;DR
TIVE is a comprehensive toolbox designed to identify and analyze errors in video instance segmentation models, focusing on spatial and temporal aspects to improve understanding and performance evaluation.
Contribution
The paper introduces TIVE, a novel toolbox that directly analyzes model outputs to classify errors and assess spatial-temporal performance in video instance segmentation.
Findings
TIVE effectively distinguishes error types and their impact on mAP.
Decomposition of spatial and temporal errors reveals their interplay.
Analysis guides improvements in model design and evaluation.
Abstract
Since first proposed, Video Instance Segmentation(VIS) task has attracted vast researchers' focus on architecture modeling to boost performance. Though great advances achieved in online and offline paradigms, there are still insufficient means to identify model errors and distinguish discrepancies between methods, as well approaches that correctly reflect models' performance in recognizing object instances of various temporal lengths remain barely available. More importantly, as the fundamental model abilities demanded by the task, spatial segmentation and temporal association are still understudied in both evaluation and interaction mechanisms. In this paper, we introduce TIVE, a Toolbox for Identifying Video instance segmentation Errors. By directly operating output prediction files, TIVE defines isolated error types and weights each type's damage to mAP, for the purpose of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
