TL;DR
This paper introduces a novel visual tracking method that maintains associations with distractor objects to improve target tracking accuracy, leveraging a learned association network and a hybrid training strategy.
Contribution
It presents a new approach that tracks distractors to enhance target tracking, differing from traditional methods that suppress distractors.
Findings
Achieves state-of-the-art performance on six benchmarks.
Sets a new AUC score of 67.1% on LaSOT.
Gains +5.8% on OxUvA long-term dataset.
Abstract
The presence of objects that are confusingly similar to the tracked target, poses a fundamental challenge in appearance-based visual tracking. Such distractor objects are easily misclassified as the target itself, leading to eventual tracking failure. While most methods strive to suppress distractors through more powerful appearance models, we take an alternative approach. We propose to keep track of distractor objects in order to continue tracking the target. To this end, we introduce a learned association network, allowing us to propagate the identities of all target candidates from frame-to-frame. To tackle the problem of lacking ground-truth correspondences between distractor objects in visual tracking, we propose a training strategy that combines partial annotations with self-supervision. We conduct comprehensive experimental validation and analysis of our approach on several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
