3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation
Ruipu Wu, Jifei Che, Han Li, Chengjing Wu, Ting Liu, Luoqi Liu

TL;DR
This paper presents a top-3 solution for video panoptic segmentation in the VIPSeg challenge, enhancing a baseline model with query-wise ensemble and techniques to improve VPQ scores.
Contribution
The paper introduces a novel approach combining query-wise ensemble with additional techniques to improve video panoptic segmentation performance.
Findings
Achieved a VPQ score of 57.01 on VIPSeg test set
Ranked 3rd in the VPS track of the challenge
Enhanced baseline model with ensemble and techniques
Abstract
Video panoptic segmentation is an advanced task that extends panoptic segmentation by applying its concept to video sequences. In the hope of addressing the challenge of video panoptic segmentation in diverse conditions, We utilize DVIS++ as our baseline model and enhance it by introducing a comprehensive approach centered on the query-wise ensemble, supplemented by additional techniques. Our proposed approach achieved a VPQ score of 57.01 on the VIPSeg test set, and ranked 3rd in the VPS track of the 3rd Pixel-level Video Understanding in the Wild Challenge.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptical Systems and Laser Technology · Photoacoustic and Ultrasonic Imaging · Advanced Semiconductor Detectors and Materials
