Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative Visual Tracking
Martin Danelljan, Gustav H\"ager, Fahad Shahbaz Khan, Michael Felsberg

TL;DR
This paper introduces a unified approach for adaptive training set decontamination in visual tracking, jointly estimating sample quality and appearance to improve robustness against corrupted data.
Contribution
It proposes a novel unified formulation that jointly optimizes target appearance and sample quality weights, effectively down-weighting corrupted samples during tracking.
Findings
Significant 3.8% improvement on OTB-2015 benchmark.
Achieved state-of-the-art results on three tracking datasets.
Unified approach outperforms existing methods in handling corrupted samples.
Abstract
Tracking-by-detection methods have demonstrated competitive performance in recent years. In these approaches, the tracking model heavily relies on the quality of the training set. Due to the limited amount of labeled training data, additional samples need to be extracted and labeled by the tracker itself. This often leads to the inclusion of corrupted training samples, due to occlusions, misalignments and other perturbations. Existing tracking-by-detection methods either ignore this problem, or employ a separate component for managing the training set. We propose a novel generic approach for alleviating the problem of corrupted training samples in tracking-by-detection frameworks. Our approach dynamically manages the training set by estimating the quality of the samples. Contrary to existing approaches, we propose a unified formulation by minimizing a single loss over both the target…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques · Visual Attention and Saliency Detection
