Robust and Real-time Deep Tracking Via Multi-Scale Domain Adaptation
Xinyu Wang, Hanxi Li, Yi Li, Fumin Shen, Fatih Porikli

TL;DR
This paper introduces a real-time deep visual tracker that uses multi-scale domain adaptation through convolutional channel reductions, achieving high speed and state-of-the-art accuracy in benchmark tests.
Contribution
It proposes a novel feature transfer method via convolutional channel reductions for efficient and accurate deep visual tracking.
Findings
Achieves real-time tracking speeds.
Maintains high accuracy comparable to state-of-the-art methods.
Effective multi-scale adaptation improves robustness.
Abstract
Visual tracking is a fundamental problem in computer vision. Recently, some deep-learning-based tracking algorithms have been achieving record-breaking performances. However, due to the high complexity of deep learning, most deep trackers suffer from low tracking speed, and thus are impractical in many real-world applications. Some new deep trackers with smaller network structure achieve high efficiency while at the cost of significant decrease on precision. In this paper, we propose to transfer the feature for image classification to the visual tracking domain via convolutional channel reductions. The channel reduction could be simply viewed as an additional convolutional layer with the specific task. It not only extracts useful information for object tracking but also significantly increases the tracking speed. To better accommodate the useful feature of the target in different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques · Advanced Vision and Imaging
