Deep Learning Techniques for Video Instance Segmentation: A Survey
Chenhao Xu, Chang-Tsun Li, Yongjian Hu, Chee Peng Lim, Douglas, Creighton

TL;DR
This survey reviews deep learning methods for video instance segmentation, highlighting architectures, auxiliary techniques, and challenges to guide future research in this rapidly evolving field.
Contribution
It provides a comprehensive overview of deep-learning schemes, comparing their architectures, performance, and complexities, and discusses auxiliary techniques and future challenges.
Findings
Various deep-learning architectures for video instance segmentation are analyzed.
Performance, model complexity, and computational costs are compared across schemes.
Key challenges and future research directions are identified.
Abstract
Video instance segmentation, also known as multi-object tracking and segmentation, is an emerging computer vision research area introduced in 2019, aiming at detecting, segmenting, and tracking instances in videos simultaneously. By tackling the video instance segmentation tasks through effective analysis and utilization of visual information in videos, a range of computer vision-enabled applications (e.g., human action recognition, medical image processing, autonomous vehicle navigation, surveillance, etc) can be implemented. As deep-learning techniques take a dominant role in various computer vision areas, a plethora of deep-learning-based video instance segmentation schemes have been proposed. This survey offers a multifaceted view of deep-learning schemes for video instance segmentation, covering various architectural paradigms, along with comparisons of functional performance,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Video Surveillance and Tracking Methods · Visual Attention and Saliency Detection
