TIVE: A Toolbox for Identifying Video Instance Segmentation Errors

Wenhe Jia; Lu Yang; Zilong Jia; Wenyi Zhao; Yilin Zhou; Qing Song

arXiv:2210.08856·cs.CV·October 18, 2022

TIVE: A Toolbox for Identifying Video Instance Segmentation Errors

Wenhe Jia, Lu Yang, Zilong Jia, Wenyi Zhao, Yilin Zhou, Qing Song

PDF

Open Access 1 Repo

TL;DR

TIVE is a comprehensive toolbox designed to identify and analyze errors in video instance segmentation models, focusing on spatial and temporal aspects to improve understanding and performance evaluation.

Contribution

The paper introduces TIVE, a novel toolbox that directly analyzes model outputs to classify errors and assess spatial-temporal performance in video instance segmentation.

Findings

01

TIVE effectively distinguishes error types and their impact on mAP.

02

Decomposition of spatial and temporal errors reveals their interplay.

03

Analysis guides improvements in model design and evaluation.

Abstract

Since first proposed, Video Instance Segmentation(VIS) task has attracted vast researchers' focus on architecture modeling to boost performance. Though great advances achieved in online and offline paradigms, there are still insufficient means to identify model errors and distinguish discrepancies between methods, as well approaches that correctly reflect models' performance in recognizing object instances of various temporal lengths remain barely available. More importantly, as the fundamental model abilities demanded by the task, spatial segmentation and temporal association are still understudied in both evaluation and interaction mechanisms. In this paper, we introduce TIVE, a Toolbox for Identifying Video instance segmentation Errors. By directly operating output prediction files, TIVE defines isolated error types and weights each type's damage to mAP, for the purpose of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wenhe-jia/tive
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques