ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation
Fan Yang, Xin Chang, Chenyu Dang, Ziqiang Zheng, Sakriani Sakti,, Satoshi Nakamura, Yang Wu

TL;DR
ReMOTS introduces a self-supervised framework to refine multi-object tracking and segmentation results by adaptively training appearance features and merging tracklets, achieving top performance in the CVPR 2020 MOTS Challenge.
Contribution
It presents a novel self-supervised approach that refines MOTS results through adaptive appearance feature training and automatic threshold determination, improving tracking accuracy.
Findings
Achieved 1st place in CVPR 2020 MOTS Challenge with an sMOTSA score of 69.9.
Demonstrated effective refinement of MOTS results using self-supervised learning.
Improved data association accuracy through adaptive thresholding.
Abstract
We aim to improve the performance of Multiple Object Tracking and Segmentation (MOTS) by refinement. However, it remains challenging for refining MOTS results, which could be attributed to that appearance features are not adapted to target videos and it is also difficult to find proper thresholds to discriminate them. To tackle this issue, we propose a self-supervised refining MOTS (i.e., ReMOTS) framework. ReMOTS mainly takes four steps to refine MOTS results from the data association perspective. (1) Training the appearance encoder using predicted masks. (2) Associating observations across adjacent frames to form short-term tracklets. (3) Training the appearance encoder using short-term tracklets as reliable pseudo labels. (4) Merging short-term tracklets to long-term tracklets utilizing adopted appearance features and thresholds that are automatically obtained from statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Image Enhancement Techniques
