DeepTrack: Learning Discriminative Feature Representations Online for Robust Visual Tracking
Hanxi Li, Yi Li, Fatih Porikli

TL;DR
DeepTrack introduces an efficient online CNN-based tracking method with a novel loss function and sample selection, achieving superior robustness and accuracy in visual tracking tasks.
Contribution
It presents a novel truncated structural loss, a robust sample selection mechanism, and an effective updating scheme for online CNN training in visual tracking.
Findings
Outperforms state-of-the-art trackers on benchmark datasets.
Demonstrates robustness to occlusion and appearance changes.
Achieves significant accuracy improvements over existing methods.
Abstract
Deep neural networks, albeit their great success on feature learning in various computer vision tasks, are usually considered as impractical for online visual tracking because they require very long training time and a large number of training samples. In this work, we present an efficient and very robust tracking algorithm using a single Convolutional Neural Network (CNN) for learning effective feature representations of the target object, in a purely online manner. Our contributions are multifold: First, we introduce a novel truncated structural loss function that maintains as many training samples as possible and reduces the risk of tracking error accumulation. Second, we enhance the ordinary Stochastic Gradient Descent approach in CNN training with a robust sample selection mechanism. The sampling mechanism randomly generates positive and negative samples from different temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
