TL;DR
This comprehensive survey reviews deep learning-based visual tracking methods, benchmark datasets, and evaluation metrics, analyzing their characteristics, performance, and challenges to guide future research and practical applications.
Contribution
It systematically investigates DL-based visual tracking techniques, benchmarks, and evaluations, providing a detailed analysis and comparison to inform future research directions.
Findings
Deep learning methods have significantly advanced visual tracking performance.
Benchmark datasets and evaluation metrics are crucial for assessing tracking methods.
Analysis reveals strengths and weaknesses of state-of-the-art trackers under various scenarios.
Abstract
Visual target tracking is one of the most sought-after yet challenging research topics in computer vision. Given the ill-posed nature of the problem and its popularity in a broad range of real-world scenarios, a number of large-scale benchmark datasets have been established, on which considerable methods have been developed and demonstrated with significant progress in recent years -- predominantly by recent deep learning (DL)-based methods. This survey aims to systematically investigate the current DL-based visual tracking methods, benchmark datasets, and evaluation metrics. It also extensively evaluates and analyzes the leading visual tracking methods. First, the fundamental characteristics, primary motivations, and contributions of DL-based methods are summarized from nine key aspects of: network architecture, network exploitation, network training for visual tracking, network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
