Towards Imbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation
Tianshu Yu, Changqun Xia, Jia Li

TL;DR
This paper introduces a new large-scale, complex dataset for video portrait segmentation and proposes a novel Part-Decoupling Network that improves segmentation accuracy by handling imbalanced part motions.
Contribution
The work presents the MVPS dataset with diverse scenes and a new PDNet with IPDA module for better part-aware segmentation in videos.
Findings
MVPS is the most complex VPS dataset to date.
PDNet outperforms state-of-the-art methods in segmentation accuracy.
Part decoupling effectively handles imbalanced motion in portrait parts.
Abstract
Video portrait segmentation (VPS), aiming at segmenting prominent foreground portraits from video frames, has received much attention in recent years. However, simplicity of existing VPS datasets leads to a limitation on extensive research of the task. In this work, we propose a new intricate large-scale Multi-scene Video Portrait Segmentation dataset MVPS consisting of 101 video clips in 7 scenario categories, in which 10,843 sampled frames are finely annotated at pixel level. The dataset has diverse scenes and complicated background environments, which is the most complex dataset in VPS to our best knowledge. Through the observation of a large number of videos with portraits during dataset construction, we find that due to the joint structure of human body, motion of portraits is part-associated, which leads that different parts are relatively independent in motion. That is, motion of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Medical Image Segmentation Techniques
