Explicitly Modeling the Discriminability for Instance-Aware Visual Object Tracking
Mengmeng Wang, Xiaoqian Yang, and Yong Liu

TL;DR
This paper introduces an Instance-Aware Tracker (IAT) that explicitly models discriminability in feature representations using contrastive learning and negative sample selection, significantly improving visual object tracking performance.
Contribution
The paper proposes a novel IAT with contrastive learning and negative sampling, including video-level and object-level variants, to enhance discriminability in tracking.
Findings
Achieves leading results on 8 benchmark datasets.
Runs at 30FPS, suitable for real-time applications.
Outperforms state-of-the-art methods in accuracy.
Abstract
Visual object tracking performance has been dramatically improved in recent years, but some severe challenges remain open, like distractors and occlusions. We suspect the reason is that the feature representations of the tracking targets are only expressively learned but not fully discriminatively modeled. In this paper, we propose a novel Instance-Aware Tracker (IAT) to explicitly excavate the discriminability of feature representations, which improves the classical visual tracking pipeline with an instance-level classifier. First, we introduce a contrastive learning mechanism to formulate the classification task, ensuring that every training sample could be uniquely modeled and be highly distinguishable from plenty of other samples. Besides, we design an effective negative sample selection scheme to contain various intra and inter classes in the instance classification branch.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Fire Detection and Safety Systems · Human Pose and Action Recognition
MethodsContrastive Learning
