1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion   Expression guided Video Segmentation

Mingqi Gao; Jingnan Luo; Jinyu Yang; Jungong Han; Feng; Zheng

arXiv:2406.07043·cs.CV·June 12, 2024

1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation

Mingqi Gao, Jingnan Luo, Jinyu Yang, Jungong Han, Feng, Zheng

PDF

Open Access 1 Repo

TL;DR

This paper presents the winning solution for the MeViS track at CVPR 2024, demonstrating effective use of static-dominant data and frame sampling to improve motion and expression-guided video segmentation.

Contribution

The paper introduces a novel approach leveraging static-dominant data and frame sampling techniques for motion and expression-guided video segmentation.

Findings

01

Achieved a J&F score of 0.5447 in the competition

02

Ranked 1st in the MeViS track of PVUW Challenge

03

Validated effectiveness of static-dominant data and frame sampling

Abstract

Motion Expression guided Video Segmentation (MeViS), as an emerging task, poses many new challenges to the field of referring video object segmentation (RVOS). In this technical report, we investigated and validated the effectiveness of static-dominant data and frame sampling on this challenging setting. Our solution achieves a J&F score of 0.5447 in the competition phase and ranks 1st in the MeViS track of the PVUW Challenge. The code is available at: https://github.com/Tapall-AI/MeViS_Track_Solution_2024.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tapall-ai/mevis_track_solution_2024
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications