D$^{\bf{3}}$: Duplicate Detection Decontaminator for Multi-Athlete Tracking in Sports Videos
Rui He, Zehua Fu, Qingjie Liu, Yunhong Wang, Xunxun Chen

TL;DR
This paper introduces D$^3$, a transformer-based duplicate detection method, and Rally-Hungarian, an algorithm for improved multi-athlete tracking in sports videos, addressing occlusion and duplicate detection challenges.
Contribution
The paper proposes a novel transformer-based duplicate detection decontaminator and a specialized matching algorithm for sports video tracking, along with a new RallyTrack dataset.
Findings
Significant improvement in tracking accuracy (9.2 MOTA, 4.5 HOTA) on RallyTrack.
D$^3$ accelerates training convergence, saving up to 80% training time.
Model trained on volleyball videos effectively generalizes to basketball and soccer.
Abstract
Tracking multiple athletes in sports videos is a very challenging Multi-Object Tracking (MOT) task, since athletes often have the same appearance and are intimately covered with each other, making a common occlusion problem becomes an abhorrent duplicate detection. In this paper, the duplicate detection is newly and precisely defined as occlusion misreporting on the same athlete by multiple detection boxes in one frame. To address this problem, we meticulously design a novel transformer-based Duplicate Detection Decontaminator (D) for training, and a specific algorithm Rally-Hungarian (RH) for matching. Once duplicate detection occurs, D immediately modifies the procedure by generating enhanced boxes losses. RH, triggered by the team sports substitution rules, is exceedingly suitable for sports videos. Moreover, to complement the tracking dataset that without shot changes, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Video Surveillance and Tracking Methods · Human Pose and Action Recognition
